ACM Home Page
Please provide us with feedback. Feedback
Efficient algorithms for exact ranked twig-pattern matching over graphs
Full text PdfPdf (615 KB)
Source
International Conference on Management of Data archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data table of contents
Vancouver, Canada
SESSION: Research Session 13: Graphs II table of contents
Pages 581-594  
Year of Publication: 2008
ISBN:978-1-60558-102-6
Authors
Gang Gou  North Carolina State University, Raleigh, NC, USA
Rada Chirkova  North Carolina State University, Raleigh, NC, USA
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 252,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1376616.1376676
What is a DOI?

ABSTRACT

Querying large-scale graph-structured data with twig patterns is attracting growing interest. Generally, a twig pattern could have an extremely large, potentially exponential, number of matches in a graph. Retrieving and returning to the user this many answers may both incur high computational overhead and overwhelm the user.

In this paper we propose two efficient algorithms, DP-B and DP-P, for retrieving top-ranked twig-pattern matches from large graphs. Our first algorithm, DP-B, is able to retrieve exact top-ranked answer matches from potentially exponentially many matches in time and space linear in the size of our data inputs even in the worst case. Further, beyond the linear-cost result of DP-B, our second algorithm, DP-P, could take far less than linear time and space cost in practice. To the best of our knowledge, our algorithms are the first to have these performance properties. Our experimental results demonstrate the high performance of both algorithms on large datasets. We also analyze and compare the performance trade-off between DP-B and DP-P from the theoretical and practical viewpoints.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. ICDE, 2002.
3
 
4
5
 
6
 
7
 
8
 
9
B. Dawes and D. Abrahams. Boost C++ Libraries. http://www.boost.org/.
 
10
DBLP. DBLP bibliography. http://dblp.uni-trier.de/xml/.
 
11
B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin. Finding top-k min-cost connected trees in databases. ICDE, 2007.
12
 
13
14
15
16
 
17
 
18
19
 
20
Oracle. Oracle Berkeley DB Java Edition (Version 3.2.44). http://www.oracle.com/technology/products/berkeley-db/ je/index.html.
 
21
 
22
 
23
W3C. XML path language (XPath) Version 1.0. http://www.w3.org/TR/xpath, 1999.
 
24
W3C. Resource Description framework (RDF). http://www.w3.org/RDF/, 2004.
 
25