ACM Home Page
Please provide us with feedback. Feedback
STHoles: a multidimensional workload-aware histogram
Full text PdfPdf (429 KB)
Source International Conference on Management of Data archive
Proceedings of the 2001 ACM SIGMOD international conference on Management of data table of contents
Santa Barbara, California, United States
Pages: 211 - 222  
Year of Publication: 2001
ISBN:1-58113-332-4
Also published in ...
Authors
Nicolas Bruno  Columbia University
Surajit Chaudhuri  Microsoft Research
Luis Gravano  Columbia University
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 57,   Citation Count: 68
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/375663.375686
What is a DOI?

ABSTRACT

Attributes of a relation are not typically independent. Multidimensional histograms can be an effective tool for accurate multiattribute query selectivity estimation. In this paper, we introduce STHoles, a “workload-aware” histogram that allows bucket nesting to capture data regions with reasonably uniform tuple density. STHoles histograms are built without examining the data sets, but rather by just analyzing query results. Buckets are allocated where needed the most as indicated by the workload, which leads to accurate query selectivity estimations. Our extensive experiments demonstrate that STHoles histograms consistently produce good selectivity estimates across synthetic and real-world data sets and across query workloads, and, in many cases, outperform the best multidimensional histogram techniques that require access to and processing of the full data sets during histogram construction.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
C. Blake and C. Merz. UCI repository of machine learning databases, 1998.
 
3
N. Bruno, S. Chaudhuri, and L. Gravano. STHoles: A multidimensional workload-aware histogram. Technical Report MSR-TR-2001-36, Microsoft Research, 2001. Accessible at ftp://ftp.research.microsoft.com/pub/- tr/tr-2001-36.pdf.
4
 
5
 
6
7
 
8
Y. Ioannidis. Query optimization. In Handbook for Computer Science. CRC Press, 1997.
9
 
10
 
11
12
13
14
15
 
16
 
17
18
19
 
20
 
21
22
 
23

CITED BY  68

Collaborative Colleagues:
Nicolas Bruno: colleagues
Surajit Chaudhuri: colleagues
Luis Gravano: colleagues