ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Covering indexes for branching path queries
Full text PdfPdf (1.37 MB)
Source International Conference on Management of Data archive
Proceedings of the 2002 ACM SIGMOD international conference on Management of data table of contents
Madison, Wisconsin
SESSION: Research sessions: path indexing table of contents
Pages: 133 - 144  
Year of Publication: 2002
ISBN:1-58113-497-5
Authors
Raghav Kaushik  University of Wisconsin
Philip Bohannon  Bell Laboratories
Jeffrey F Naughton  University of Wisconsin
Henry F Korth  Bell Laboratories
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 33,   Citation Count: 64
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/564691.564707
What is a DOI?

ABSTRACT

In this paper, we ask if the traditional relational query acceleration techniques of summary tables and covering indexes have analogs for branching path expression queries over tree- or graph-structured XML data. Our answer is yes --- the forward-and-backward index already proposed in the literature can be viewed as a structure analogous to a summary table or covering index. We also show that it is the smallest such index that covers all branching path expression queries. While this index is very general, our experiments show that it can be so large in practice as to offer little performance improvement over evaluating queries directly on the data. Likening the forward-and-backward index to a covering index on all the attributes of several tables, we devise an index definition scheme to restrict the class of branching path expressions being indexed. The resulting index structures are dramatically smaller and perform better than the full forward-and-backward index for these classes of branching path expressions. This is roughly analogous to the situation in multidimensional or OLAP workloads, in which more highly aggregated summary tables can service a smaller subset of queries but can do so at increased performance. We evaluate the performance of our indexes on both relational decompositions of XML and a native storage technique. As expected, the performance benefit of an index is maximized when the query matches the index definition.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Xmark: The xml benchmark project. http://monetdb.cwi.nl/xml/index.html.
 
2
 
3
J. Clark and S. DeRose. XML path language (XPath) 1.0. W3C recommendation. World Wide Web Consortium, http://www.w3.org/TR/xpath, Nov. 1999.
 
4
 
5
S. DeRose, E. Maler, and D. Orchard. The XLink standard. World Wide Web Consortium, http ://www.w3.org/TR/xquery, Nov. 1999.
 
6
 
7
R. Goldman and J. Widom. Approximate DataGuides. In Proc. of the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, pages 436-445, January 1999.
8
 
9
 
10
R. Milner. A Calculus for Communicating Processes, volume 92 of Lecture Notes in Computer Science. Springer Verlag, 1980.
 
11
 
12
 
13
 
14
 
15
Open Directory Project. DMOZ open directory project. http://www.dmoz.org.
 
16
F. Rizzolo and A. Mendelzon. Indexing XML data with ToXin. In Proc. of WebDB 2001, 2001.
 
17
R. Kaushik, P. Shenoy, P. Bohannon, and E. Gudes. Exploiting local similarity for efficient indexing of paths in graph structured data. In Proceedings of ICDE, 2002.
 
18
19
20

CITED BY  64

Collaborative Colleagues:
Raghav Kaushik: colleagues
Philip Bohannon: colleagues
Jeffrey F Naughton: colleagues
Henry F Korth: colleagues