|
ABSTRACT
The important challenge of evaluating XPath queries over XML streams has sparked much interest in the past two years, A number of algorithms have been proposed, supporting wider fragments of the query language, and exhibiting better performance and memory utilization. Nevertheless, all the algorithms known to date use a prohibitively large amount of memory for certain types of queries. A natural question then is whether this memory bottleneck is inherent or just an artifact of the proposed algorithms.In this paper we initiate the first systematic and theoretical study of lower bounds on the amount of memory required to evaluate XPath queries over XML streams. We present a general lower bound technique, which given a query, specifies the minimum amount of memory that any algorithm evaluating the query on a stream would need to incur. The lower bounds are stated in terms of new graph-theoretic properties of queries. The proof is based on tools from communication complexity.We then exploit insights learned from the lower bounds to obtain a new algorithm for XPath evaluation on streams. The algorithm uses space close to the optimum. Our algorithm deviates from the standard paradigm of using automata or transducers, thereby avoiding the need to store large transition tables.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Arvind Arasu , Brian Babcock , Shivnath Babu , Jon McAlister , Jennifer Widom, Characterizing memory requirements for queries over continuous data streams, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 03-05, 2002, Madison, Wisconsin
[doi> 10.1145/543613.543642]
|
| |
3
|
I. Avila-Campillo, T. J. Green, A. Gupta, M. Onizuka, D. Raven, and D. Suciu. XMLTK: An XML toolkit for scalable XML stream processing. In Proc. 1st Workshop on Programming Languages for XML (PLAN-X), 2002.
|
| |
4
|
S. Boag, D. Chamberlin, M. F. Fernández, D. Florescu, J. Robie, and J. Siméosn. XQuery 1.0: An XML Query Language. W3C, http://www.w3.org/TR/xquery, 2003.
|
| |
5
|
T. Bray, J. Paoli, and C. M. Sperbeg-McQueen. Extensible Markup Language (XML) 1.0. W3C, http://www.w3.org/TR/1998/REC-xml-19980210, 1998.
|
| |
6
|
|
| |
7
|
B. Choi, M. Mahoui, and D. Wood. On the optimality of holistic algorithms for twig queries. In Proc. 14th DEXA, pages 28--37, 2003.
|
| |
8
|
B. Choi, M. Mahoui, and D. Wood. The optimality of holistic algorithms for XPath. http://www.cis.upenn.edu/~kkchoi/xpath.pdf, 2003.
|
| |
9
|
J. Clark. XSL Transformations (XSLT) Version 1.0. W3C, http://www.w3.org/TR/xslt, 1999.
|
| |
10
|
J. Clark and S. DeRose. XML Path Language (XPath), Version 1.0. W3C, http://www.w3.org/TR/xpath, 1999.
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
Z. Ives, A. Levy, and D. Weld. Efficient evaluation of regular path expressions on streaming XML data. Technical report, University of Washington, 2000.
|
| |
17
|
|
| |
18
|
|
| |
19
|
D. Olteanu, T. Kiesling, and F. Bry. An evaluation of regular path expressions with qualifiers against XML streams. In Proc. 19th ICDE, pages 702--704, 2003.
|
 |
20
|
|
 |
21
|
|
 |
22
|
|
 |
23
|
|
CITED BY 17
|
|
|
|
|
|
|
|
|
|
|
K. Selçuk Candan , Wang-Pin Hsiung , Songting Chen , Junichi Tatemura , Divyakant Agrawal, AFilter: adaptable XML filtering with prefix-caching suffix-clustering, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chi Yang , Chengfei Liu , Jianxin Li , Jeffrey Xu Yu , Junhu Wang, Semantics based buffer reduction for queries over XML data streams, Proceedings of the nineteenth conference on Australasian database, December 03-04, 2007, Gold Coast, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|