ACM Home Page
Please provide us with feedback. Feedback
Automated statistics collection in action
Full text PdfPdf (295 KB)
Source International Conference on Management of Data archive
Proceedings of the 2005 ACM SIGMOD international conference on Management of data table of contents
Baltimore, Maryland
SESSION: Demonstrations: Group 3 table of contents
Pages: 933 - 935  
Year of Publication: 2005
ISBN:1-59593-060-4
Authors
P. Haas  IBM Almaden Research Center, San Jose, CA
M. Kandil  IBM Toronto Development Laboratory, ON, Canada
A. Lerner  IBM Almaden Research Center, San Jose, CA
V. Markl  IBM Almaden Research Center, San Jose, CA
I. Popivanov  IBM Toronto Development Laboratory, Markham, ON, Canada
V. Raman  IBM Almaden Research Center, San Jose, CA
D. Zilio  IBM Toronto Development Laboratory, Markham, ON, Canada
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 28,   Citation Count: 0
Additional Information:

abstract   references   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066157.1066293
What is a DOI?

ABSTRACT

If presented with inaccurate statistics, even the most sophisticated query optimizers make mistakes. They may wrongly estimate the output cardinality of a certain operation and thus make sub-optimal plan choices based on that cardinality. Maintaining accurate statistics is hard, both because each table may need a specifically parameterized set of statistics and because statistics get outdated as the database changes. Automated Statistic Collection (ASC) is a new component in IBM DB2 UDB that, without any DBA intervention, observes and analyzes the effects of faulty statistics and, in response, it triggers actions that continuously repair the latter. In this demonstration, we will show how ASC works to alleviate the DBA from the task of maintaining fresh, accurate statistics in several challenging scenarios. ASC is able to reconfigure the statistics collection parameters (e.g, number of frequent values for a column, or correlations between certain column pairs) on a per-table basis. ASC can also detect and guard against outdated statistics caused by high updates/inserts/deletes rates in volatile, dynamic databases. We will also show how ASC works from the inside: from how cardinality mis-estimations are introduced in different kind of operators, to how this error is propagated to later operations in the plan, to how this influences plan choices inside the optimizer.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Aboulnaga, P. J. Haas, M. Kandil, S. Lightstone, G. M. Lohman, V. Markl, I. Popivanov, and V. Raman. Automated Statistics Collection in DB2 UDB. In Proc. of the Int'l Conf. on Very Large Databases (VLDB), pages 1146--1157, 2004.
 
2
Collaborative Colleagues:
P. Haas: colleagues
M. Kandil: colleagues
A. Lerner: colleagues
V. Markl: colleagues
I. Popivanov: colleagues
V. Raman: colleagues
D. Zilio: colleagues