| Semantic query optimization for processing XML streams with minimized memory footprint |
| Full text |
Pdf
(268 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 353
archive
Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
table of contents
Nantes, France
SESSION: Query processing
table of contents
Pages 27-36
Year of Publication: 2008
ISBN:978-1-59593-966-1
|
|
Authors
|
|
Ming Li
|
Worcester Polytechnic Institute, Worcester, Massachusetts
|
|
Murali Mani
|
Worcester Polytechnic Institute, Worcester, Massachusetts
|
|
Elke A. Rundensteiner
|
Worcester Polytechnic Institute, Worcester, Massachusetts
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 3, Downloads (12 Months): 29, Citation Count: 1
|
|
|
ABSTRACT
XQuery evaluation over XML streams requires the temporary buffering of XML elements. This paper presents a semantic query optimization solution to minimize memory footprint during XQuery evaluation by exploiting schema knowledge. We focus on one particular class of constraints, namely, the Pattern Non-Occurrence (PNO) constraints for XML streams conforming to pre-defined DTDs. PNO constraints facilitate the early release of buffered data (early buffer release) or possibly avoid to ever store the data (buffer avoidance), thus achieving a minimized memory footprint. We develop an automaton-based technique to detect PNO constraints at runtime. For a given query, optimization opportunities of early buffer release and buffer avoidance which can be triggered by runtime PNO detection are explored and the optimization decision is then encoded into the Raindrop algebraic plan. We implement our optimization technique within the Raindrop XQuery engine. Our experimental studies illustrate that the proposed techniques bring significant performance improvement in both memory and CPU usage with little overhead.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. 1995.
|
 |
2
|
|
| |
3
|
|
| |
4
|
T. Bray, J. Paoli, C. M. Sperberg-McQueen, E. Maler, and F. Yergeau. Extensible markup language 1.0 (fourth edition). In http://www.w3.org/TR/REC-xml/, 2006.
|
| |
5
|
|
| |
6
|
N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. Navigation- vs. index-based xml multi-query processing. In ICDE, pages 139--150, 2003.
|
| |
7
|
|
| |
8
|
|
| |
9
|
P. S. Database. http://pir.georgetown.edu.
|
| |
10
|
|
 |
11
|
|
| |
12
|
Christoph Koch , Stefanie Scherzinger , Nicole Schweikardt , Bernhard Stegmaier, Schema-based scheduling of event processors and buffer minimization for queries on structured data streams, Proceedings of the Thirtieth international conference on Very large data bases, p.228-239, August 31-September 03, 2004, Toronto, Canada
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Michael J. Carey , Ioana Manolescu , Ralph Busse, XMark: a benchmark for XML data management, Proceedings of the 28th international conference on Very Large Data Bases, p.974-985, August 20-23, 2002, Hong Kong, China
|
| |
19
|
M. Schmidt, S. Scherzinger, and C. Koch. Combined static and dynamic analysis for effective buffer minimization. In ICDE, pages 236--245, 2007.
|
| |
20
|
H. Su, E. A. Rundensteiner, and M. Mani. Semantic query optimization for XQuery over xml streams. In VLDB, pages 1293--1296, 2005.
|
| |
21
|
|
| |
22
|
W3C. XQuery 1.0 and Xpath 2.0 formal semantics. http://www.w3.org/TR/query-semantics, 2004.
|
| |
23
|
Song Wang , Hong Su , Ming Li , Mingzhu Wei , Shoushen Yang , Drew Ditto , Elke A. Rundensteiner , Murali Mani, R-SOX: runtime semantic query optimization over XML streams, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|