ACM Home Page
Please provide us with feedback. Feedback
Semantic query optimization for processing XML streams with minimized memory footprint
Full text PdfPdf (268 KB)
Source ACM International Conference Proceeding Series; Vol. 353 archive
Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web table of contents
Nantes, France
SESSION: Query processing table of contents
Pages 27-36  
Year of Publication: 2008
ISBN:978-1-59593-966-1
Authors
Ming Li  Worcester Polytechnic Institute, Worcester, Massachusetts
Murali Mani  Worcester Polytechnic Institute, Worcester, Massachusetts
Elke A. Rundensteiner  Worcester Polytechnic Institute, Worcester, Massachusetts
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 29,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1416691.1416698
What is a DOI?

ABSTRACT

XQuery evaluation over XML streams requires the temporary buffering of XML elements. This paper presents a semantic query optimization solution to minimize memory footprint during XQuery evaluation by exploiting schema knowledge. We focus on one particular class of constraints, namely, the Pattern Non-Occurrence (PNO) constraints for XML streams conforming to pre-defined DTDs. PNO constraints facilitate the early release of buffered data (early buffer release) or possibly avoid to ever store the data (buffer avoidance), thus achieving a minimized memory footprint. We develop an automaton-based technique to detect PNO constraints at runtime. For a given query, optimization opportunities of early buffer release and buffer avoidance which can be triggered by runtime PNO detection are explored and the optimization decision is then encoded into the Raindrop algebraic plan. We implement our optimization technique within the Raindrop XQuery engine. Our experimental studies illustrate that the proposed techniques bring significant performance improvement in both memory and CPU usage with little overhead.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. 1995.
2
 
3
 
4
T. Bray, J. Paoli, C. M. Sperberg-McQueen, E. Maler, and F. Yergeau. Extensible markup language 1.0 (fourth edition). In http://www.w3.org/TR/REC-xml/, 2006.
 
5
 
6
N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. Navigation- vs. index-based xml multi-query processing. In ICDE, pages 139--150, 2003.
 
7
 
8
 
9
P. S. Database. http://pir.georgetown.edu.
 
10
11
 
12
 
13
 
14
 
15
 
16
17
 
18
 
19
M. Schmidt, S. Scherzinger, and C. Koch. Combined static and dynamic analysis for effective buffer minimization. In ICDE, pages 236--245, 2007.
 
20
H. Su, E. A. Rundensteiner, and M. Mani. Semantic query optimization for XQuery over xml streams. In VLDB, pages 1293--1296, 2005.
 
21
 
22
W3C. XQuery 1.0 and Xpath 2.0 formal semantics. http://www.w3.org/TR/query-semantics, 2004.
 
23


Collaborative Colleagues:
Ming Li: colleagues
Murali Mani: colleagues
Elke A. Rundensteiner: colleagues