ACM Home Page
Please provide us with feedback. Feedback
Rate-based query optimization for streaming information sources
Full text PdfPdf (1.11 MB)
Source International Conference on Management of Data archive
Proceedings of the 2002 ACM SIGMOD international conference on Management of data table of contents
Madison, Wisconsin
SESSION: Research sessions: continuous queries and streams table of contents
Pages: 37 - 48  
Year of Publication: 2002
ISBN:1-58113-497-5
Authors
Stratis D. Viglas  University of Wisconsin-Madison, Madison, WI
Jeffrey F. Naughton  University of Wisconsin-Madison, Madison, WI
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 70,   Citation Count: 46
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/564691.564697
What is a DOI?

ABSTRACT

Relational query optimizers have traditionally relied upon table cardinalities when estimating the cost of the query plans they consider. While this approach has been and continues to be successful, the advent of the Internet and the need to execute queries over streaming sources requires a different approach, since for streaming inputs the cardinality may not be known or may not even be knowable (as is the case for an unbounded stream.) In view of this, we propose shifting from a cardinality-based approach to a rate-based approach, and give an optimization framework that aims at maximizing the output rate of query evaluation plans. This approach can be applied to cases where the cardinality-based approach cannot be used. It may also be useful for cases where cardinalities are known, because by focusing on rates we are able not only to optimize the time at which the last result tuple appears, but also to optimize for the number of answers computed at any specified time after the query evaluation commences. We present a preliminary validation of our rate-based optimization framework on a prototype XML query engine, though it is generic enough to be used in other database contexts. The results show that rate-based optimization is feasible and can indeed yield correct decisions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Arasu, B. Babcock, S. Babu, J. McAlister and J. Widom, Characterizing Memory Requirements for Queries over Continuous Data Streams, Stanford Techinical Report, November 2001, http://dbpubs.stanford.edu/pub/2001-49.
2
 
3
4
5
 
6
 
7
8
 
9
10
 
11
Z. G. Ives, A. Y. Levy and D. S. Weld. Efficient Evaluation of Regular Path Expressions on Streaming XML Data, University of Washington, Technical Report UW-CSE-2000-05-02.
12
 
13
 
14
J. Naughton, D. J. DeWitt, D. Maier et al. The Niagara Internet Query System, IEEE Data Engineering Bulletin 24 (2): 27-33 (2001), http://www.cs.wisc.edu/niagara.
 
15
16
 
17
 
18
G. Schumacher. GEI's Experience with Britton-Lee's IDM, IWDM, 1983, pp. 233-241.
19
 
20
 
21
J. Shanmugasundaram, K. Tufte, D. J. DeWitt, J. F. Naughton and D. Maier. Architecting a Network Query Engine for Producing Partial Results, WebDB 2000.
 
22
T. Urhan and M. J. Franklin. Xjoin: A Reactively-Scheduled Pipelined Join Operator, IEEE Data Engineering Bulletin, June 2000, (23) 2:27-33.
23
 
24
A. N. Wilschut and P. M. G. Apers. Pipelining in Query Execution, Conference on Databases, Parallel Architectures and their Applications, Miami, 1991.
25

CITED BY  46

Collaborative Colleagues:
Stratis D. Viglas: colleagues
Jeffrey F. Naughton: colleagues