ACM Home Page
Please provide us with feedback. Feedback
XPath query evaluation based on the stack encoding
Full text PdfPdf (736 KB)
Source
ACM International Conference Proceeding Series archive
Proceedings of the 2nd Canadian Conference on Computer Science and Software Engineering table of contents
Montreal, Quebec, Canada
SESSION: Databases 2 (full papers) table of contents
Pages 43-57  
Year of Publication: 2009
ISBN:978-1-60558-401-0
Authors
Yangjun Chen  University of Winnipeg, Winnipeg, Manitoba, Canada
Donovan Cooke  University of Winnipeg, Winnipeg, Manitoba, Canada
Sponsors
ACM : Assoc. for Computing Machinery
: BytePress
Concordia University : Concordia University
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 35,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557626.1557634
What is a DOI?

ABSTRACT

The twig join, which is used to find all occurrences of a twig pattern in an XML database, is a core operation for XML query processing. A great many strategies for handling this problem have been proposed and can be roughly classified into two groups. The first group decomposes a twig pattern (a small tree) into a set of binary relationships between pairs of nodes, such as parent-child and ancestor-descendant relations; and transforms a tree matching problem into a series of simple relation look-ups. The second group decomposes a twig pattern into a set of paths. Among all this kind of methods, the approach based on the so-called stack encoding by Bruno et. al. [2] is very interesting, which can represent in linear space a potentially exponential (in the number of query nodes) number of matching paths. However, the available processes for generating such compressed paths suffer some redundancy and can be significantly improved. In this paper, we analyze this method and show that the time complexities of path generation in its two main procedures: PathStack and TwigStack can be reduced from O(m2 ·n) to O(m ·n), where m and n are the sizes of the query tree and document tree, respectively. Experiments have been done to compare ours and some existing startegies, which shows that using our method much less time is needed to generate matching paths.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
D. D. Chamberlin, J. Clark, D. Florescu and M. Stefanescu. "XQuery1.0: An XML Query Language," http://www.w3.org/TR/query-datamodel/.
 
4
 
5
 
6
A. Dutch, M. Fernandes, D. Florescu, A. Levy, D. Suciu. "A Query Language for XML," WWW'99.
 
7
D. Florescu and D. Kossman, Storing and Querying XML data using an RDMBS, IEEE Data Engineering Bulletin, 22(3):27--34, 1999.
 
8
 
9
 
10
U. of Washington, The Tukwila System, available from http://data.cs.washington.edu/integration/tukwila/.
 
11
U. of Wisconsin, The Niagara System, available from http://www.cs.wisc.edu/niagara/.
12
 
13
 
14
World Wide Web Consortium. XML Path Language (XPath), W3C Recommendation, Version 1.0, November 1999. See http://www.w3.org/TR/xpath.
 
15
World Wide Web Consortium. XQuery 1.0: An XML Query Language, W3C Recommendation, Version 1.0, Dec. 2001. See http://www.w3.org/TR/xquery.
16
 
17
U. of Wisconsin, The Niagara System, available from http://www.cs.wisc.edu/niagara/.
 
18
XMARK: The XML-benchmark project, http://monetdb.cwi.nl/xml, 2002.

Collaborative Colleagues:
Yangjun Chen: colleagues
Donovan Cooke: colleagues