ACM Home Page
Please provide us with feedback. Feedback
Raindrop: a uniform and layered algebraic framework for XQueries on XML streams
Full text PdfPdf (706 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the twelfth international conference on Information and knowledge management table of contents
New Orleans, LA, USA
SESSION: Database session 5: management of data streams table of contents
Pages: 279 - 286  
Year of Publication: 2003
ISBN:1-58113-723-0
Authors
Hong Su  Worcester Polytechnic Institute
Jinhui Jian  Worcester Polytechnic Institute
Elke A. Rundensteiner  Worcester Polytechnic Institute
Sponsors
ACM: Association for Computing Machinery
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 21,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/956863.956917
What is a DOI?

ABSTRACT

XML stream applications bring the challenge of efficiently processing queries on sequentially accessible token-based data. While the automata model is naturally suited for pattern matching on tokenized XML streams, the algebraic model in contrast is a well-established technique for set-oriented processing of self-contained tuples. However, neither automata nor algebraic models are well-equipped to handle both computation paradigms.

The goal of the Raindrop project is to accommodate these two paradigms within one algebraic framework to take advantage of both. In our query model, both tokenized data and self-contained tuples are supported in a uniform manner. Query plans can be flexibly rewritten using equivalence rules to change what computation is done using tokenized data versus tuples. This paper highlights the four abstraction levels in Raindrop, namely, semantics-focused plan, stream logical plan, stream physical plan and execution plan. Various optimization techniques are provided at each level. The necessity of such a uniform and layered plan is shown by experimental study


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Schmidt, F. Waas and R. Busse et al. Xmark -- The XML-Benchmark Project. http://www.xml-benchmark.org, 2001.
2
 
3
D. Barbosa, A. Mendelzon, and J. Keenleyside et al. ToXgene: a Template-Based Data Generator for XML. In Proceedings of WEBDB, pages 49--54, 2002.
 
4
D. Carney, U. Cetintemel, and M. Cherniack et al. Monitoring Streams - A New Class of Data Management Applications. In Proceedings of VLDB, pages 215--226, 2002.
 
5
Y. Diao and M. Franklin. Query Processing for High-Volume XML Message Brokering. In Proceeding of VLDB, to appear, 2003.
6
 
7
 
8
 
9
J. Jian, H. Su, and E. Rundensteiner. Automaton Meets Query Algebra: Towards A Unified Model for XQuery Evaluation over XML Data Streams. In Proceedings of ER, to appear, 2003.
 
10
B. Ludascher, P. Mukhopadhyay, and Y. Papakonstantinou. A Transducer- Based XML Query Processor. In Proceedings of VLDB, pages 215--226, 2002.
 
11
 
12
13
 
14
Sourceforge. SAX 2.0. http://www.saxproject.org/, 2002.
15

CITED BY  8

Collaborative Colleagues:
Hong Su: colleagues
Jinhui Jian: colleagues
Elke A. Rundensteiner: colleagues