|
ABSTRACT
The W3C XQuery language recommendation, based on a hierarchical and ordered document model, supports a wide variety of constructs and use cases. There is a diversity of approaches and strategies for evaluating XQuery expressions, in many cases only dealing with limited subsets of the language. In this paper we describe an implementation approach that handles XQuery with arbitrarily-nested FLWR expressions, element constructors and built-in functions (including structural comparisons). Our proposal maps an XQuery expression to a single equivalent SQL query using a novel dynamic interval encoding of a collection of XML documents as relations, augmented with information tied to the query evaluation environment. The dynamic interval technique enables (suitably enhanced) relational engines to produce predictably good query plans that do not preclude the use of sort-merge join query operators. The benefits are realized despite the challenges presented by intermediate results that create arbitrary documents and the need to preserve document order as prescribed by semantics of XQuery. Finally, our experimental results demonstrate that (native or relational) XML systems can benefit from the above technique to avoid a quadratic scale up penalty that effectively prevents the evaluation of nested FLWR expressions for large documents.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Galax. Available from http://db.bell-labs.com/galax/.
|
| |
2
|
IPSI-XQ - the XQuery demonstrator. Available from http://ipsi.fhg.de/oasys/projects/ipsi-xq/.
|
| |
3
|
Kweelt. Available from http://kweelt.sourceforge.net.
|
| |
4
|
QuiP. Available from http://developer.softwareag.com/ tamino/quip/.
|
| |
5
|
X-Hive/DB. Available from http://www.x-hive.com.
|
| |
6
|
XMark -- an XML benchmark project. Available from http://www.xml-benchmark.org.
|
| |
7
|
V. Aguilera, S. Cluet, P. Veltri, D. Vodislav, and F. Wattez. Querying XML documents in Xyleme. In Proc. ACM SIGIR Workshop on XML and Information Retrieval, 2000.
|
| |
8
|
|
| |
9
|
D. Barbosa, A. Barta, A. O. Mendelzon, G. A. Mihaila, F. Rizzolo, and P. Rodriguez-Guianolli. ToX - The Toronto XML Engine. In Proc. Workshop on Information Integration on the Web, pages 66--73, 2001.
|
| |
10
|
S. Boag, D. Chamberlin, D. Florescu, J. Robie, J. Simeon, and M. Stefanescu. XQuery 1.0: An XML Query Language. Technical report, W3C, 2001.
|
| |
11
|
|
| |
12
|
L. J. Brown , M. P. Consens , I. J. Davis , C. R. Palmer , F. W. Tompa, A structured text ADT for object-relational databases, Theory and Practice of Object Systems, v.4 n.4, p.227-244, Oct. 12, 1998
[doi> 10.1002/(SICI)1096-9942(1998)4:4<227::AID-TAPO3>3.3.CO;2-L]
|
 |
13
|
|
| |
14
|
|
| |
15
|
J. Celko. Trees, Databases and SQL. DBMS, 7(10):48--57, 1994.
|
| |
16
|
|
| |
17
|
D. Chamberlin, P. Fankhauser, M. Marchiori, and J. Robie. XML Query Use Cases. Technical report, W3C, 2001.
|
| |
18
|
Y. Chen, G. Mihaila, S. Padmanabhan, and R. Bordawekar. Labeling Your XML. (preliminary version presented at CASCON'02 Conf.), October 2002.
|
 |
19
|
|
 |
20
|
|
 |
21
|
|
| |
22
|
P. Fankhauser, M. Fernandez, A. Malhotra, M. Rys, J. Simeon, and P. Wadler. XQuery 1.0 Formal Semantics. Technical report, W3C, 2001.
|
 |
23
|
|
| |
24
|
|
| |
25
|
D. Florescu and D. Kossmann. Storing and Querying XML Data using an RDMBS. IEEE Data Engineering Bulletin, 22(3):27--34, 1999.
|
| |
26
|
|
 |
27
|
|
| |
28
|
Jayavel Shanmugasundaram , Eugene J. Shekita , Rimon Barr , Michael J. Carey , Bruce G. Lindsay , Hamid Pirahesh , Berthold Reinwald, Efficiently Publishing Relational Data as XML Documents, Proceedings of the 26th International Conference on Very Large Data Bases, p.65-76, September 10-14, 2000
|
| |
29
|
Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton, Relational Databases for Querying XML Documents: Limitations and Opportunities, Proceedings of the 25th International Conference on Very Large Data Bases, p.302-314, September 07-10, 1999
|
 |
30
|
Richard Thomas Snodgrass , Ilsoo Ahn , Gadi Ariav , Don Batory , James Clifford , Curtis E. Dyreson , Ramez Elmasri , Fabio Grandi , Christian S. Jensen , Wolfgang Käfer , Nick Kline , Krishna Kulkarni , T. Y. Cliff Leung , Nikos Lorentzos , John F. Roddick , Arie Segev , Michael D. Soo , Suryanarayana M. Sripada, TSQL2 language specification, ACM SIGMOD Record, v.23 n.1, p.65-86, March 1994
[doi> 10.1145/181550.181562]
|
 |
31
|
Igor Tatarinov , Stratis D. Viglas , Kevin Beyer , Jayavel Shanmugasundaram , Eugene Shekita , Chun Zhang, Storing and querying ordered XML using a relational database system, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
[doi> 10.1145/564691.564715]
|
| |
32
|
F. Tian, D. DeWitt, J. Chen, and C. Zhang. The design and performance evaluation of alternative XML storage strategies. Technical report, University of Wisconsin, 2002.
|
| |
33
|
|
| |
34
|
D. Toman and G. E. Weddell. Querying XML: On the Utility of Interval Encoding. Technical report, University of Waterloo, 2002.
|
 |
35
|
|
 |
36
|
Chun Zhang , Jeffrey Naughton , David DeWitt , Qiong Luo , Guy Lohman, On supporting containment queries in relational database management systems, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.425-436, May 21-24, 2001, Santa Barbara, California, United States
|
CITED BY 34
|
|
|
|
|
|
|
|
Yi Chen , George A. Mihaila , Susan B. Davidson , Sriram Padmanabhan, EXPedite: a system for encoded XML processing, Proceedings of the thirteenth ACM international conference on Information and knowledge management, November 08-13, 2004, Washington, D.C., USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wenfei Fan , Jeffrey Xu Yu , Hongjun Lu , Jianhua Lu , Rajeev Rastogi, Query translation from XPATH to SQL in the presence of recursive DTDs, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
Kevin Beyer , Roberta J. Cochrane , Vanja Josifovski , Jim Kleewein , George Lapis , Guy Lohman , Bob Lyle , Fatma Özcan , Hamid Pirahesh , Normen Seemann , Tuong Truong , Bert Van der Linden , Brian Vickery , Chun Zhang, System RX: one part relational, one part XML, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
|
|
|
William M. Shui , Franky Lam , Damien K. Fisher , Raymond K. Wong, Querying and maintaining ordered XML data using relational databases, Proceedings of the sixteenth Australasian database conference, p.85-94, January 01, 2005, Newcastle, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
K. Beyer , R. Cochrane , M. Hvizdos , V. Josifovski , J. Kleewein , G. Lapis , G. Lohman , R. Lyle , M. Nicola , F. Özcan , H. Pirahesh , N. Seemann , A. Singh , T. Truong , R. C. Van der Linden , B. Vickery , C. Zhang , G. Zhang, DB2 goes hybrid: integratng native XML and XQuery with relational data and SQL, IBM Systems Journal, v.45 n.2, p.271-298, January 2006
|
|
|
Markus Kirchberg , Faizal Riaz-ud-Din , Klaus-Dieter Schewe , Alexei Tretiakov, Using reflection for querying XML documents, Proceedings of the 17th Australasian Database Conference, p.119-128, January 16-19, 2006, Hobart, Australia
|
|
|
Peter Boncz , Torsten Grust , Maurice van Keulen , Stefan Manegold , Jan Rittinger , Jens Teubner, MonetDB/XQuery: a fast XQuery processor powered by a relational engine, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
|
|
|
|
|
|
Yifeng Zheng , Stephen Fisher , Shirley Cohen , Sheng Guo , Junhyong Kim , Susan B. Davidson, Crimson: a data management system to support evaluating phylogenetic tree reconstruction algorithms, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
Zhimin Chen , H. V. Jagadish , Laks V. S. Lakshmanan , Stelios Paparizos, From tree patterns to generalized tree patterns: on efficient evaluation of XQuery, Proceedings of the 29th international conference on Very large data bases, p.237-248, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Guangjun Xie , Qi Cheng , Jarek Gryz , Calisto Zuzarte, Some rewrite optimizations of DB2 XQuery navigation, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|