ACM Home Page
Please provide us with feedback. Feedback
Why off-the-shelf RDBMSs are better at XPath than you might expect
Full text PdfPdf (270 KB)
Source
International Conference on Management of Data archive
Proceedings of the 2007 ACM SIGMOD international conference on Management of data table of contents
Beijing, China
SESSION: XML table of contents
Pages: 949 - 958  
Year of Publication: 2007
ISBN:978-1-59593-686-8
Authors
Torsten Grust  Technische Universität München, Munich, Germany
Jan Rittinger  Technische Universität München, Munich, Germany
Jens Teubner  Technische Universität München, Munich, Germany
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 101,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1247480.1247591
What is a DOI?

ABSTRACT

To compensate for the inherent impedance mismatch between the relational data model (tables of tuples) and XML (ordered, unranked trees), tree join algorithms have become the prevalent means to process XML data in relational databases, most notably the TwigStack[6], structural join[1], and staircase join[13] algorithms. However, the addition of these algorithms to existing systems depends on a significant invasion of the underlying database kernel, an option intolerable for most database vendors.

Here, we demonstrate that we can achieve comparable XPath performance without touching the heart of the system. We carefully exploit existing database functionality and accelerate XPath navigation by purely relational means: partitioned B-trees bring access costs to secondary storage to a minimum, while aggregation functions avoid an expensive computation and removal of duplicate result nodes to comply with the XPath semantics. Experiments carried out on IBM DB2 confirm that our approach can turn off-the-shelf database systems into efficient XPath processors.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Shurug Al-Khalifa, H. V. Jagadish, Jignesh M. Patel, Yuqing Wu, Nick Koudas, and Divesh Srivastava. Structural Joins: A Primitive for Efficient XML Query Pattern Matching. In Proc. of the 18th Int'l Conference on Data Engineering (ICDE), San Jose, CA, USA, February 2002.
2
 
3
Anders Berglund, Scott Boag, Don Chamberlin, Mary F. Fernández, Michael Kay, Jonathan Robie, and Jérôme Siméon. XML Path Language (XPath) 2.0. World Wide Web Consortium Candidate Recommendation, September 2005.
 
4
Scott Boag, Don Chamberlin, Mary F. Fernández, Daniela Florescu, Jonathan Robie, and Jérôme Siméon. XQuery 1.0: An XML Query Language. World Wide Web Consortium Candidate Recommendation, September 2005.
5
6
7
8
 
9
Goetz Graefe. Sorting and Indexing with Partitioned B-Trees. In Proc. of the 1st Int'l Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, USA, January 2003.
10
11
 
12
Torsten Grust, Sherif Sakr, and Jens Teubner. XQuery on SQL Hosts. In Proc. of the 30th Int'l Conference on Very Large Databases (VLDB), Toronto, Canada, September 2004.
 
13
Torsten Grust, Maurice van Keulen, and Jens Teubner. Staircase Join: Teach a Relational DBMS to Watch its (Axis) Steps. In Proc. of the 29th Int'l Conference on Very Large Databases (VLDB), Berlin, Germany, September 2003.
 
14
Haifeng Jiang, Hongjun Lu, Wei Wang, and Beng Chin Ooi. XR-Tree: Indexing XML Data for Efficient Structural Joins. In Proc. of the 19th Int'l Conference on Data Engineering (ICDE), Bangalore, India, March 2003.
 
15
 
16
 
17
Microsoft Corporation. SQL Server 2005, 2005.
18
 
19
Shankar Pal, Istvan Cseri, Oliver Seeliger, Gideon Schaller, Leo Giakoumakis, and Vasili Zolotov. Indexing XML Data Stored in a Relational Database. In Proc. of the 30th Int'l Conference on Very Large Databases (VLDB), Toronto, Canada, September 2004.
 
20
 
21
Albrecht Schmidt, Florian Waas, Martin L. Kersten, Michael J. Carey, Ioana Manolescu, and Ralph Busse. XMark: A Benchmark for XML Data Management. In Proc. of the 28th Int'l Conference on Very Large Databases (VLDB), Hong Kong, China, August 2002.
 
22
23
24


Collaborative Colleagues:
Torsten Grust: colleagues
Jan Rittinger: colleagues
Jens Teubner: colleagues