| What makes the differences: benchmarking XML database implementations |
| Full text |
Pdf
(589 KB)
|
| Source
|
ACM Transactions on Internet Technology (TOIT)
archive
Volume 5 , Issue 1 (February 2005)
table of contents
Pages: 154 - 194
Year of Publication: 2005
ISSN:1533-5399
|
|
Authors
|
|
Hongjun Lu
|
The Hong Kong University of Science and Technology, Hong Kong, China
|
|
Jeffrey Xu Yu
|
The Chinese University of Hong Kong, Hong Kong, China
|
|
Guoren Wang
|
Northeastern University, Shenyang, China
|
|
Shihui Zheng
|
Fudan University, Shanghai, China
|
|
Haifeng Jiang
|
The Hong Kong University of Science and Technology, Hong Kong, China
|
|
Ge Yu
|
Northeastern University, Shenyang, China
|
|
Aoying Zhou
|
Fudan University, Shanghai, China
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 46, Downloads (12 Months): 301, Citation Count: 3
|
|
|
ABSTRACT
XML is emerging as a major standard for representing data on the World Wide Web. Recently, many XML storage models have been proposed to manage XML data. In order to assess an XML database's abilities to deal with XML queries, several benchmarks have also been proposed, including XMark and XMach. However, no reported studies using those benchmarks were found that can provide users with insights on the impacts of a variety of storage models on XML query performance. In this article, we report our first set of results on benchmarking a set of XML database implementations using two XML benchmarks. The selected implementations represent a wide range of approaches, including RDBMS-based systems with document-independent and document-dependent XML-relational schema mapping approaches, and XML native engines based on an Object-Oriented Model and the Document Object Model. Comprehensive experiments were conducted to study relative performance of different approaches and the important issues that affect XML query performance, such as path expression query processing, effectiveness of various partitioning, label-path, and indexing structures.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J. L. 1997. The Lorel query language for semistructured data. International Journal on Digital Libraries 1, 1, 68--88.
|
| |
2
|
|
| |
3
|
Berglund, A., Boag, S., Chamberlin, D., Fernandez, M. F., Kay, M., Robie, J., and Simeon, J. 2002. XML path language (XPath) 2.0. Tech. rep.
|
| |
4
|
Boag, S., Chamberlin, D., Fernandez, M. F., Florescu, D., Robie, J., and Simeon, J. 2002. XQuery 1.0: An XML query language. In W3C Working Draft 16 August 2002.
|
| |
5
|
|
 |
6
|
|
| |
7
|
|
| |
8
|
Stefano Ceri , Sara Comai , Ernesto Damiani , Piero Fraternali , Stefano Paraboschi , Letizia Tanca, XML-GL: a graphical language for querying and restructuring XML documents, Proceeding of the eighth international conference on World Wide Web, p.1171-1187, May 1999, Toronto, Canada
|
| |
9
|
|
| |
10
|
Chien, S. Y., Vagena, Z., Zhang, D., Tsotras, V., and Zaniolo, C. 2002. Efficient structural joins on indexed XML documents. In Proceedings of the 28th International Conference on Very Large Data Bases. Hong Kong, China. 263--274.
|
 |
11
|
V. Christophides , S. Abiteboul , S. Cluet , M. Scholl, From structured documents to novel query facilities, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.313-324, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
12
|
Alin Deutsch , Mary Fernandez , Daniela Florescu , Alon Levy , Dan Suciu, A query language for XML, Proceeding of the eighth international conference on World Wide Web, p.1155-1169, May 1999, Toronto, Canada
|
 |
13
|
Alin Deutsch , Mary Fernandez , Dan Suciu, Storing semistructured data with STORED, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.431-442, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
 |
14
|
Mary Fernández , Daniela Florescu , Jaewoo Kang , Alon Levy , Dan Suciu, Catching the boat with Strudel: experiences with a Web-site management system, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.414-425, June 01-04, 1998, Seattle, Washington, United States
|
| |
15
|
Florescu, D. and Kossmann, D. 1999. A performance evaluation of alternative mapping schemes for storing XML data in a relational database. Survey report.
|
 |
16
|
|
| |
17
|
Haifeng Jiang , Hongjun Lu , Wei Wang , Jeffrey Xu Yu, Path materialization revisited: an efficient storage model for XML data, Proceedings of the thirteenth Australasian database conference, p.85-94, January 01, 2002, Melbourne, Victoria, Australia
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
 |
22
|
|
| |
23
|
|
 |
24
|
Hongjun Lu , Guoren Wang , Ge Yu , Yubin Bao , Jianhua Lv , Yaxin Yu, XBase: making your gigabyte disk queriable, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
[doi> 10.1145/564691.564785]
|
| |
25
|
Jianhua Lv , Guoren Wang , Jeffrey Xu Yu , Ge Yu , Hongjun Lu , Bing Sun, Performance Evaluation of a DOM-Based XML Database: Storage, Indexing and Query Optimization, Proceedings of the Third International Conference on Advances in Web-Age Information Management, p.13-24, August 11-13, 2002
|
 |
26
|
|
| |
27
|
|
| |
28
|
Schmidt, A., Waas, F., Kersten, M., Carey, M. J., Manolescu, I., and Busse, R. 2002. Xmark: A benchmark for XML data management. In Proceedings of the 28th International Conference on Very Large Data Bases. (Hong Kong, China). 974--985.
|
| |
29
|
A. R. Schmidt , Florian Waas , Martin L. Kersten , D. Florescu , I. Manolescu , M. J. Carey , R. Busse, The XML benchmark project, CWI (Centre for Mathematics and Computer Science), Amsterdam, The Netherlands, 2001
|
| |
30
|
Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton, Relational Databases for Querying XML Documents: Limitations and Opportunities, Proceedings of the 25th International Conference on Very Large Data Bases, p.302-314, September 07-10, 1999
|
| |
31
|
|
| |
32
|
Tian, F., DeWitt, D. J., Chen, J., and Zhang, C. 2000. The design and performance evaluation of altervative XML storage strategies. Tech. rep., Computer Science Department, University of Wisconsin, Madison, WI.
|
| |
33
|
W3C. Document object model (DOM). http://www.w3.org/DOM/.
|
| |
34
|
|
 |
35
|
|
 |
36
|
Chun Zhang , Jeffrey Naughton , David DeWitt , Qiong Luo , Guy Lohman, On supporting containment queries in relational database management systems, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.425-436, May 21-24, 2001, Santa Barbara, California, United States
|
| |
37
|
Aoying Zhou , Hongjun Lu , Shihui Zheng , Yuqi Liang , Long Zhang , Wenyun Ji , Zengping Tian, VXMLR: A Visual XML-Relational Database System, Proceedings of the 27th International Conference on Very Large Data Bases, p.719-720, September 11-14, 2001
|
CITED BY 3
|
|
Fengjun Li , Bo Luo , Peng Liu , Dongwon Lee , Chao-Hsien Chu, Automaton segmentation: a new approach to preserve privacy in xml information brokering, Proceedings of the 14th ACM conference on Computer and communications security, October 28-31, 2007, Alexandria, Virginia, USA
|
|
|
Seung Min Kim , Suk I. Yoo , Eunji Hong , Tae Gwon Kim , Il Kon Kim, A document object modeling method to retrieve data from a very large XML document, Proceedings of the 2007 ACM symposium on Document engineering, August 28-31, 2007, Winnipeg, Manitoba, Canada
|
|
|
|
|