ACM Home Page
Please provide us with feedback. Feedback
MonetDB/XQuery: a fast XQuery processor powered by a relational engine
Full text PdfPdf (606 KB)
Source International Conference on Management of Data archive
Proceedings of the 2006 ACM SIGMOD international conference on Management of data table of contents
Chicago, IL, USA
SESSION: Query processing for XML data table of contents
Pages: 479 - 490  
Year of Publication: 2006
ISBN:1-59593-434-0
Authors
Peter Boncz  CWI Amsterdam, The Netherlands
Torsten Grust  Technische Universität München, Germany
Maurice van Keulen  University of Twente, The Netherlands
Stefan Manegold  CWI Amsterdam, The Netherlands
Jan Rittinger  Technische Universität München, Germany
Jens Teubner  Technische Universität München, Germany
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 35,   Downloads (12 Months): 144,   Citation Count: 28
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1142473.1142527
What is a DOI?

ABSTRACT

Relational XQuery systems try to re-use mature relational data management infrastructures to create fast and scalable XML database technology. This paper describes the main features, key contributions, and lessons learned while implementing such a system. Its architecture consists of (i) a range-based encoding of XML documents into relational tables, (ii) a compilation technique that translates XQuery into a basic relational algebra, (iii) a restricted (order) property-aware peephole relational query optimization strategy, and (iv) a mapping from XML update statements into relational updates. Thus, this system implements all essential XML database functionalities (rather than a single feature) such that we can learn from the full consequences of our architectural decisions. While implementing this system, we had to extend the state-of-the-art with a number of new technical contributions, such as loop-lifted staircase join and efficient relational query evaluation strategies for XQuery theta-joins with existential semantics. These contributions as well as the architectural lessons learned are also deemed valuable for other relational back-end engines. The performance and scalability of the resulting system is evaluated on the XMark benchmark up to data sizes of 11GB. The performance section also provides an extensive benchmark comparison of all major XMark results published previously, which confirm that the goal of purely relational XQuery processing, namely speed and scalability, was met.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Berkeley DB XML. http://www.sleepycat.com/products/bdbxml.html.
3
 
4
P. Boncz. Monet: A Next-Generation DBMS Kernel For Query-Intensive Applications. PhD thesis, UVA, 2002.
 
5
P. Boncz et al. Pathfinder: Relational XQuery Over Multi-Gigabyte XML Inputs In Interactive Time. Technical Report INS-E0503, CWI, 2005.
 
6
P. Boncz, S. Manegold, and J. Rittinger. Updating the Pre/Post Plane in MonetDB/XQuery. In XIME-P, 2005.
7
 
8
 
9
M. Carey, D. Florescu, Z. Ives, Y. Lu, S. J., E. Shekita, and S. S. XPERANTO: Publishing Object-Relational Data as XML. In WebDB, 2000.
 
10
D. Chamberlin, D. Florescu, and J. Robie. XQuery Update Facility. W3C, 2006. http://www.w3.org/TR/xqupdate/.
11
12
 
13
A. Deutsch and V. Tannen. MARS: A System for Publishing XML from Mixed and Redundant Storage. In VLDB Conf., 2003.
 
14
M. Fernández, J. Siméon, B. Choi, A. Marian, and G. Sur. Implementing XQuery 1.0: The Galax Experience. In VLDB Conf., 2003.
 
15
D. Florescu et al. The BEA/XQRL Streaming XQuery Processor. In VLDB Conf., 2003.
 
16
T. Grust. Purely Relational FLWORs. In Proc. XIME-P, 2005.
 
17
T. Grust, S. Sakr, and J. Teubner. XQuery on SQL Hosts. In VLDB Conf., 2004.
 
18
T. Grust, M. van Keulen, and J. Teubner. Staircase Join: Teach a Relational DBMS to Watch its (Axis) Steps. In VLDB Conf., 2003.
19
 
20
IPSI-XQ. http://ipsi.fhg.de/oasys/projects/ipsi-xq/.
 
21
 
22
C. Koch, S. Scherzinger, N. Schweikardt, and B. Stegmaier. Schema-based Scheduling of Event Processors and Buffer Minimization for Queries on Structured Data Streams. In VLDB Conf., 2004.
 
23
R. Krishnamurthy, R. Kaushik, and J. Naughton. XML-to-SQL Query Translation Literature: The State of the Art and Open Problems. In Proc. XSym, 2003.
 
24
Kweelt. http://kweelt.sourceforge.net/.
 
25
 
26
27
 
28
 
29
G. Moerkotte and T. Neumann. A Combined Framework for Grouping and Order Optimization. In VLDB Conf., 2004.
30
 
31
 
32
 
33
Qizx/open. http://www.xfra.net/qizxopen/.
 
34
QuiP. http://developer.softwareag.com/tamino/quip/.
 
35
Saxon. http://saxon.sourceforge.net/.
 
36
A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse. XMark: A Benchmark for XML Data Management. In VLDB Conf., 2002.
37
38
 
39
X. Wang and M. Cherniack. Avoiding Sorting and Grouping in Processing Queries. In VLDB Conf., 2003.
 
40
X-Hive/DB. http://www.x-hive.com/.
41

CITED BY  28

Collaborative Colleagues:
Peter Boncz: colleagues
Torsten Grust: colleagues
Maurice van Keulen: colleagues
Stefan Manegold: colleagues
Jan Rittinger: colleagues
Jens Teubner: colleagues