| Parallelization of XPath queries using multi-core processors: challenges and experiences |
| Full text |
Pdf
(539 KB)
|
| Source
|
Extending Database Technology; Vol. 360
archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
table of contents
Saint Petersburg, Russia
SESSION: Research sessions: XML, XPath, XQuery
table of contents
Pages 180-191
Year of Publication: 2009
ISBN:978-1-60558-422-5
|
|
Authors
|
|
Rajesh Bordawekar
|
IBM T. J. Watson Research Center, Hawthorne, NY
|
|
Lipyeow Lim
|
IBM T. J. Watson Research Center, Hawthorne, NY
|
|
Oded Shmueli
|
IBM T. J. Watson Research Center, Hawthorne, NY and Technion, Haifa, Israel
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 95, Citation Count: 0
|
|
|
ABSTRACT
In this study, we present experiences of parallelizing XPath queries using the Xalan XPath engine on shared-address space multi-core systems. For our evaluation, we consider a scenario where an XPath processor uses multiple threads to concurrently navigate and execute individual XPath queries on a shared XML document. Given the constraints of the XML execution and data models, we propose three strategies for parallelizing individual XPath queries: Data partitioning, Query partitioning, and Hybrid (query and data) partitioning. We experimentally evaluated these strategies on an x86 Linux multi-core system using a set of XPath queries, invoked on a variety of XML documents using the Xalan XPath APIs. Experimental results demonstrate that the proposed parallelization strategies work very effectively in practice; for a majority of XPath queries under evaluation, the execution performance scaled linearly as the number of threads was increased. Results also revealed the pros and cons of the different parallelization strategies for different XPath query patterns.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
DBLP XML Dataset. http://dblp.uni-trier.de/xml.
|
| |
2
|
SQL/XML Standard. www.sqlx.org.
|
| |
3
|
The PENN Treebank Project. http://www.cis.upenn.edu/treebank.
|
| |
4
|
Web Services Activities at W3C. www.w3.org/2002/ws.
|
| |
5
|
XML Schema Activities at W3C. http://www.w3.org/XML/Schema.
|
| |
6
|
|
 |
7
|
|
| |
8
|
World Wide Web Consortium (W3C), XML Path Language (XPath) 2.0, W3C Recommendation, 23 January 2007. www.w3.org.
|
| |
9
|
World Wide Web Consortium (W3C), XQuery 1.0: An XML Query Language, W3C Recommendation, 23 January 2007. www.w3.org.
|
| |
10
|
World Wide Web Consortium (W3C), XQuery 1.0 and XPath 2.0 Data Model (XDM), W3C Recommendation, 23 January 2007. www.w3.org.
|
| |
11
|
World Wide Web Consortium (W3C), XSL Transformations (XSLT) 2.0, W3C Recommendation, 23 January 2007. www.w3.org.
|
 |
12
|
|
| |
13
|
Herlihy, M., and Shavit, N. The Art of Multiprocessor Programming. Morgan Kaufmann, 2008.
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
 |
18
|
Wei Lu , Dennis Gannon, Parallel XML processing by work stealing, Proceedings of the 2007 workshop on Service-oriented computing performance: aspects, issues, and approaches, p.31-38, June 25-25, 2007, Monterey, California, USA
[doi> 10.1145/1272457.1272462]
|
| |
19
|
|
 |
20
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Daniela Florescu , Michael J. Carey , Ioana Manolescu , Ralph Busse, Why and how to benchmark XML databases, ACM SIGMOD Record, v.30 n.3, September 2001
[doi> 10.1145/603867.603872]
|
 |
21
|
|
|