| Scalable approximate query processing with the DBO engine |
| Full text |
Pdf
(369 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
table of contents
Beijing, China
SESSION: Approximate and probabilistic processing
table of contents
Pages: 725 - 736
Year of Publication: 2007
ISBN:978-1-59593-686-8
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 123, Citation Count: 5
|
|
|
ABSTRACT
This paper describes query processing in the DBO database system. Like other database systems designed for ad-hoc, analytic processing, DBO is able to compute the exact answer to queries over a large relational database in a scalable fashion. Unlike any other system designed for analytic processing, DBO can constantly maintain a guess as to the final answer to an aggregate query throughout execution, along with statistically meaningful bounds for the guess's accuracy. As DBO gathers more and more information, the guess gets more and more accurate, until it is 100% accurate as the query is completed. This allows users to stop the execution at any time that they are happy with the query accuracy, and encourages exploratory data analysis.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Acharya, P. Gibons, V. Poosala, S. Ramaswamy: Join Synopses for Approximate Query Processing. SIGMOD 1999:275--286.
|
| |
2
|
S. Chaudhuri, R. Motwani, V. R. Narasayya: On Random Sampling over Joins. SIGMOD 1999: 263--274.
|
| |
3
|
W. Cochran: Sampling Techniques. Wiley and Sons, 1977.
|
 |
4
|
Jens-Peter Dittrich , Bernhard Seeger , David Scot Taylor , Peter Widmayer, On producing join results early, Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.134-142, June 09-11, 2003, San Diego, California
[doi> 10.1145/773153.773167]
|
| |
5
|
J. P. Dittrich, B. Seeger, D. S. Taylor, P. Widmayer: Progressive Merge Join: A Generic and Non-blocking Sort-based Join Algorithm. VLDB 2002: 299--310.
|
| |
6
|
P. J. Haas, J. M. Hellerstein: Ripple Joins for Online Aggregation. SIGMOD 1999: 287--298.
|
| |
7
|
|
| |
8
|
|
| |
9
|
G. H. Hardy, J. E. Littlewood, and G. Polya. Inequalities. Cambridge University Press, 1988.
|
| |
10
|
Joseph M. Hellerstein , Ron Avnur , Andy Chou , Christian Hidber , Chris Olston , Vijayshankar Raman , Tali Roth , Peter J. Haas, Interactive Data Analysis: The Control Project, Computer, v.32 n.8, p.51-59, August 1999
[doi> 10.1109/2.781635]
|
| |
11
|
J. M. Hellerstein, P. J. Haas, H. J. Wang: Online Aggregation. SIGMOD 1997: 171--182.
|
| |
12
|
|
| |
13
|
C. Jermaine, A. Dobra, S. Arumugam, S. Joshi, A. Pol: A Disk-Based Join with Probabilistic Guarantees. SIGMOD 2005: 456--467.
|
| |
14
|
|
 |
15
|
|
| |
16
|
F. Olken: Random Sampling from Databases. PhD Thesis, U. of California, Berkeley, 1993
|
| |
17
|
F. Olken, D. Rotem, P. Xu: Random Sampling from Hash Files. SIGMOD 1990: 375--386.
|
| |
18
|
|
 |
19
|
|
| |
20
|
J. Shao: Mathematical Statistics. Springer-Verlag, 1999.
|
CITED BY 5
|
|
|
|
|
Florin Rusu , Fei Xu , Luis Leopoldo Perez , Mingxi Wu , Ravi Jampani , Chris Jermaine , Alin Dobra, The DBO database system, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
|
|
|
|
|
|
Ravi Jampani , Fei Xu , Mingxi Wu , Luis Leopoldo Perez , Christopher Jermaine , Peter J. Haas, MCDB: a monte carlo approach to managing uncertain data, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
|
|
|
|
|