ACM Home Page
Please provide us with feedback. Feedback
Robust query processing through progressive optimization
Full text PdfPdf (331 KB)
Source International Conference on Management of Data archive
Proceedings of the 2004 ACM SIGMOD international conference on Management of data table of contents
Paris, France
SESSION: Research sessions: query optimization table of contents
Pages: 659 - 670  
Year of Publication: 2004
ISBN:1-58113-859-8
Authors
Volker Markl  IBM Almaden Research Center, San Jose, CA
Vijayshankar Raman  IBM Almaden Research Center, San Jose, CA
David Simmen  IBM Silicon Valley Lab, San Jose, CA
Guy Lohman  IBM Almaden Research Center, San Jose, CA
Hamid Pirahesh  IBM Almaden Research Center, San Jose, CA
Miso Cilimdzic  IBM Toronto Lab, Toronto, Canada
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 126,   Citation Count: 31
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1007568.1007642
What is a DOI?

ABSTRACT

Virtually every commercial query optimizer chooses the best plan for a query using a cost model that relies heavily on accurate cardinality estimation. Cardinality estimation errors can occur due to the use of inaccurate statistics, invalid assumptions about attribute independence, parameter markers, and so on. Cardinality estimation errors may cause the optimizer to choose a sub-optimal plan. We present an approach to query processing that is extremely robust because it is able to detect and recover from cardinality estimation errors. We call this approach "progressive query optimization" (POP). POP validates cardinality estimates against actual values as measured during query execution. If there is significant disagreement between estimated and actual values, execution might be stopped and re-optimization might occur. Oscillation between optimization and execution steps can occur any number of times. A re-optimization step can exploit both the actual cardinality and partial results, computed during a previous execution step. Checkpoint operators (CHECK) validate the optimizer's cardinality estimates against actual cardinalities. Each CHECK has a condition that indicates the cardinality bounds within which a plan is valid. We compute this validity range through a novel sensitivity analysis of query plan operators. If the CHECK condition is violated, CHECK triggers re-optimization. POP has been prototyped in a leading commercial DBMS. An experimental evaluation of POP using TPC-H queries illustrates the robustness POP adds to query processing, while incurring only negligible overhead. A case-study applying POP to a real-world database and workload shows the potential of POP, accelerating complex OLAP queries by almost two orders of magnitude.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
6
 
7
8
 
9
HS93 P. Haas and A. Swami, Sampling-Based Selectivity Estimation for Joins Using Augmented Frequent Value Statistics, IBM Research Report, 1993
 
10
HS02 A. Hulgeri and S. Sudarshan. Parametric Query Optimization for Linear and Piecewise Linear Cost Functions. VLDB 2002.
11
 
12
13
14
15
 
16
 
17
RAH03 V. Raman, A. Deshpande, and J. M. Hellerstein, Using State Modules for Adaptive Query Optimization. ICDE 2003
18
 
19
20
21
 
22
23

CITED BY  31
Collaborative Colleagues:
Volker Markl: colleagues
Vijayshankar Raman: colleagues
David Simmen: colleagues
Guy Lohman: colleagues
Hamid Pirahesh: colleagues
Miso Cilimdzic: colleagues