ACM Home Page
Please provide us with feedback. Feedback
Scalable approximate query processing with the DBO engine
Full text PdfPdf (369 KB)
Source
International Conference on Management of Data archive
Proceedings of the 2007 ACM SIGMOD international conference on Management of data table of contents
Beijing, China
SESSION: Approximate and probabilistic processing table of contents
Pages: 725 - 736  
Year of Publication: 2007
ISBN:978-1-59593-686-8
Authors
Christopher Jermaine  University of Florida, Gainesville, FL
Subramanian Arumugam  University of Florida, Gainesville, FL
Abhijit Pol  University of Florida, Gainesville, FL
Alin Dobra  University of Florida, Gainesville, FL
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 107,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1247480.1247560
What is a DOI?

ABSTRACT

This paper describes query processing in the DBO database system. Like other database systems designed for ad-hoc, analytic processing, DBO is able to compute the exact answer to queries over a large relational database in a scalable fashion. Unlike any other system designed for analytic processing, DBO can constantly maintain a guess as to the final answer to an aggregate query throughout execution, along with statistically meaningful bounds for the guess's accuracy. As DBO gathers more and more information, the guess gets more and more accurate, until it is 100% accurate as the query is completed. This allows users to stop the execution at any time that they are happy with the query accuracy, and encourages exploratory data analysis.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Acharya, P. Gibons, V. Poosala, S. Ramaswamy: Join Synopses for Approximate Query Processing. SIGMOD 1999:275--286.
 
2
S. Chaudhuri, R. Motwani, V. R. Narasayya: On Random Sampling over Joins. SIGMOD 1999: 263--274.
 
3
W. Cochran: Sampling Techniques. Wiley and Sons, 1977.
4
 
5
J. P. Dittrich, B. Seeger, D. S. Taylor, P. Widmayer: Progressive Merge Join: A Generic and Non-blocking Sort-based Join Algorithm. VLDB 2002: 299--310.
 
6
P. J. Haas, J. M. Hellerstein: Ripple Joins for Online Aggregation. SIGMOD 1999: 287--298.
 
7
 
8
 
9
G. H. Hardy, J. E. Littlewood, and G. Polya. Inequalities. Cambridge University Press, 1988.
 
10
 
11
J. M. Hellerstein, P. J. Haas, H. J. Wang: Online Aggregation. SIGMOD 1997: 171--182.
 
12
 
13
C. Jermaine, A. Dobra, S. Arumugam, S. Joshi, A. Pol: A Disk-Based Join with Probabilistic Guarantees. SIGMOD 2005: 456--467.
 
14
15
 
16
F. Olken: Random Sampling from Databases. PhD Thesis, U. of California, Berkeley, 1993
 
17
F. Olken, D. Rotem, P. Xu: Random Sampling from Hash Files. SIGMOD 1990: 375--386.
 
18
19
 
20
J. Shao: Mathematical Statistics. Springer-Verlag, 1999.


Collaborative Colleagues:
Christopher Jermaine: colleagues
Subramanian Arumugam: colleagues
Abhijit Pol: colleagues
Alin Dobra: colleagues