ACM Home Page
Please provide us with feedback. Feedback
Exploiting statistics on query expressions for optimization
Full text PdfPdf (1.33 MB)
Source International Conference on Management of Data archive
Proceedings of the 2002 ACM SIGMOD international conference on Management of data table of contents
Madison, Wisconsin
SESSION: Research sessions: query processing I table of contents
Pages: 263 - 274  
Year of Publication: 2002
ISBN:1-58113-497-5
Authors
Nicolas Bruno  Columbia University
Surajit Chaudhuri  Microsoft Research
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 80,   Citation Count: 37
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/564691.564722
What is a DOI?

ABSTRACT

Statistics play an important role in influencing the plans produced by a query optimizer. Traditionally, optimizers use statistics built over base tables and assume independence between attributes while propagating statistical information through the query plan. This approach can introduce large estimation errors, which may result in the optimizer choosing inefficient execution plans. In this paper, we show how to extend a generic optimizer so that it also exploits statistics built on expressions corresponding to intermediate nodes of query plans. We show that in some cases, the quality of the resulting plans is significantly better than when only base-table statistics are available. Unfortunately, even moderately-sized schemas may have too many relevant candidate statistics. We introduce a workload-driven technique to identify a small subset of statistics that can provide significant benefits over just maintaining base-table statistics. Finally, we present experimental results on an implementation of our approach in Microsoft SQL Server 2000.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
 
5
 
6
 
7
8
9
 
10
G. Graefe. The cascades framework for query optimization. Data Engineering Bulletin, 18(3), 1995.
11
 
12
13
14
15
16
 
17
18
 
19
20
 
21
 
22
TPC Benchmark H. Decision support. Available at http://www.tpc.org.

CITED BY  37

Collaborative Colleagues:
Nicolas Bruno: colleagues
Surajit Chaudhuri: colleagues