ACM Home Page
Please provide us with feedback. Feedback
Parallelization of XPath queries using multi-core processors: challenges and experiences
Full text PdfPdf (539 KB)
Source Extending Database Technology; Vol. 360 archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology table of contents
Saint Petersburg, Russia
SESSION: Research sessions: XML, XPath, XQuery table of contents
Pages 180-191  
Year of Publication: 2009
ISBN:978-1-60558-422-5
Authors
Rajesh Bordawekar  IBM T. J. Watson Research Center, Hawthorne, NY
Lipyeow Lim  IBM T. J. Watson Research Center, Hawthorne, NY
Oded Shmueli  IBM T. J. Watson Research Center, Hawthorne, NY and Technion, Haifa, Israel
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 95,   Citation Count: 0
Additional Information:

abstract   references   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1516360.1516382
What is a DOI?

ABSTRACT

In this study, we present experiences of parallelizing XPath queries using the Xalan XPath engine on shared-address space multi-core systems. For our evaluation, we consider a scenario where an XPath processor uses multiple threads to concurrently navigate and execute individual XPath queries on a shared XML document. Given the constraints of the XML execution and data models, we propose three strategies for parallelizing individual XPath queries: Data partitioning, Query partitioning, and Hybrid (query and data) partitioning. We experimentally evaluated these strategies on an x86 Linux multi-core system using a set of XPath queries, invoked on a variety of XML documents using the Xalan XPath APIs. Experimental results demonstrate that the proposed parallelization strategies work very effectively in practice; for a majority of XPath queries under evaluation, the execution performance scaled linearly as the number of threads was increased. Results also revealed the pros and cons of the different parallelization strategies for different XPath query patterns.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
DBLP XML Dataset. http://dblp.uni-trier.de/xml.
 
2
SQL/XML Standard. www.sqlx.org.
 
3
The PENN Treebank Project. http://www.cis.upenn.edu/treebank.
 
4
Web Services Activities at W3C. www.w3.org/2002/ws.
 
5
XML Schema Activities at W3C. http://www.w3.org/XML/Schema.
 
6
7
 
8
World Wide Web Consortium (W3C), XML Path Language (XPath) 2.0, W3C Recommendation, 23 January 2007. www.w3.org.
 
9
World Wide Web Consortium (W3C), XQuery 1.0: An XML Query Language, W3C Recommendation, 23 January 2007. www.w3.org.
 
10
World Wide Web Consortium (W3C), XQuery 1.0 and XPath 2.0 Data Model (XDM), W3C Recommendation, 23 January 2007. www.w3.org.
 
11
World Wide Web Consortium (W3C), XSL Transformations (XSLT) 2.0, W3C Recommendation, 23 January 2007. www.w3.org.
12
 
13
Herlihy, M., and Shavit, N. The Art of Multiprocessor Programming. Morgan Kaufmann, 2008.
 
14
15
 
16
 
17
18
 
19
20
21
Collaborative Colleagues:
Rajesh Bordawekar: colleagues
Lipyeow Lim: colleagues
Oded Shmueli: colleagues