ACM Home Page
Please provide us with feedback. Feedback
Self-tuning histograms: building histograms without looking at data
Full text PdfPdf (1.67 MB)
Source International Conference on Management of Data archive
Proceedings of the 1999 ACM SIGMOD international conference on Management of data table of contents
Philadelphia, Pennsylvania, United States
Pages: 181 - 192  
Year of Publication: 1999
ISBN:1-58113-084-8
Also published in ...
Authors
Ashraf Aboulnaga  Computer Sciences Department, University of Wisconsin, Madison
Surajit Chaudhuri  Microsoft Research
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 64,   Citation Count: 62
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/304182.304198
What is a DOI?

ABSTRACT

In this paper, we introduce self-tuning histograms. Although similar in structure to traditional histograms, these histograms infer data distributions not by examining the data or a sample thereof, but by using feedback from the query execution engine about the actual selectivity of range selection operators to progressively refine the histogram. Since the cost of building and maintaining self-tuning histograms is independent of the data size, self-tuning histograms provide a remarkably inexpensive way to construct histograms for large data sets with little up-front costs. Self-tuning histograms are particularly attractive as an alternative to multi-dimensional traditional histograms that capture dependencies between attributes but are prohibitively expensive to build and maintain. In this paper, we describe the techniques for initializing and refining self-tuning histograms. Our experimental results show that self-tuning histograms provide a low-cost alternative to traditional multi-dimensional histograms with little loss of accuracy for data distributions with low to moderate skew.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

CR94
 
GMP97
KD98
 
Koo80
LNS90
MD88
MRL98
MVW98
NHS84
PIHS96
 
PI97
 
Zip49
G.K. Zipf. Human behaviour and the principle of least effort. Addison-Wesley, Reading, MA, 1949.

CITED BY  62

Collaborative Colleagues:
Ashraf Aboulnaga: colleagues
Surajit Chaudhuri: colleagues