ACM Home Page
Please provide us with feedback. Feedback
Fast incremental maintenance of approximate histograms
Full text PdfPdf (473 KB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 27 ,  Issue 3  (September 2002) table of contents
Pages: 261 - 298  
Year of Publication: 2002
ISSN:0362-5915
Authors
Phillip B. Gibbons  Intel Research Pittsburgh, Pittsburgh, PA
Yossi Matias  Tel Aviv University, Tel Aviv, Israel
Viswanath Poosala  Bell Laboratories, Murray Hill, NJ
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 94,   Citation Count: 17
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/581751.581753
What is a DOI?

ABSTRACT

Many commercial database systems maintain histograms to summarize the contents of large relations and permit efficient estimation of query result sizes for use in query optimizers. Delaying the propagation of database updates to the histogram often introduces errors into the estimation. This article presents new sampling-based approaches for incremental maintenance of approximate histograms. By scheduling updates to the histogram based on the updates to the database, our techniques are the first to maintain histograms effectively up to date at all times and avoid computing overheads when unnecessary. Our techniques provide highly accurate approximate histograms belonging to the equidepth and Compressed classes. Experimental results show that our new approaches provide orders of magnitude more accurate estimation than previous approaches.An important aspect employed by these new approaches is a backing sample, an up-to-date random sample of the tuples currently in a relation. We provide efficient solutions for maintaining a uniformly random sample of a relation in the presence of updates to the relation. The backing sample techniques can be used for any other application that relies on random samples of data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
6
7
8
9
 
10
 
11
12
13
 
14
Gilbert, A. C., Kotidis, Y., Muthukrishnan, S., and Strauss, M. J. 2002b. How to summarize the universe: Dynamic maintenance of quantiles. In Proceedings of the 28th International Conference on Very Large Data Bases, Morgan Kaufman, Hong Kong.
15
16
17
18
 
19
 
20
 
21
 
22
23
24
 
25
 
26
 
27
 
28
29
30
31
32
 
33
Zipf, G. K. 1949. Human Behaviour and the Principle of Least Effort. Addison-Wesley, Reading, Mass.

CITED BY  17

Collaborative Colleagues:
Phillip B. Gibbons: colleagues
Yossi Matias: colleagues
Viswanath Poosala: colleagues