ACM Home Page
Please provide us with feedback. Feedback
Adaptive rank-aware query optimization in relational databases
Full text PdfPdf (1.48 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 31 ,  Issue 4  (December 2006) table of contents
Pages: 1257 - 1304  
Year of Publication: 2006
ISSN:0362-5915
Authors
Ihab F. Ilyas  University of Waterloo, Waterloo, Ontario, Canada
Walid G. Aref  Purdue University, West Lafayette, IN
Ahmed K. Elmagarmid  Purdue University, West Lafayette, IN
Hicham G. Elmongui  Purdue University, West Lafayette, IN
Rahul Shah  Purdue University, West Lafayette, IN
Jeffrey Scott Vitter  Purdue University, West Lafayette, IN
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 144,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1189769.1189772
What is a DOI?

ABSTRACT

Rank-aware query processing has emerged as a key requirement in modern applications. In these applications, efficient and adaptive evaluation of top-k queries is an integral part of the application semantics. In this article, we introduce a rank-aware query optimization framework that fully integrates rank-join operators into relational query engines. The framework is based on extending the System R dynamic programming algorithm in both enumeration and pruning. We define ranking as an interesting physical property that triggers the generation of rank-aware query plans. Unlike traditional join operators, optimizing for rank-join operators depends on estimating the input cardinality of these operators. We introduce a probabilistic model for estimating the input cardinality, and hence the cost of a rank-join operator. To our knowledge, this is the first effort in estimating the needed input size for optimal rank aggregation algorithms. Costing ranking plans is key to the full integration of rank-join operators in real-world query processing engines.Since optimal execution strategies picked by static query optimizers lose their optimality due to estimation errors and unexpected changes in the computing environment, we introduce several adaptive execution strategies for top-k queries that respond to these unexpected changes and costing errors. Our reactive reoptimization techniques change the execution plan at runtime to significantly enhance the performance of running queries. Since top-k query plans are usually pipelined and maintain a complex ranking state, altering the execution strategy of a running ranking query is an important and challenging task.We conduct an extensive experimental study to evaluate the performance of the proposed framework. The experimental results are twofold: (1) we show the effectiveness of our cost-based approach of integrating ranking plans in dynamic programming cost-based optimizers; and (2) we show a significant speedup (up to 300%) when using our adaptive execution of ranking plans over the state-of-the-art mid-query reoptimization strategies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
4
 
5
6
 
7
 
8
9
 
10
Deshpande, A. and Hellerstein, J. M. 2004. Lifting the burden of history from adaptive query processing. In Proceedings of the 30 International Conference on Very Large Data Bases. 948--959.
 
11
12
 
13
14
15
 
16
 
17
 
18
19
 
20
 
21
Hristidis, V., Gravano, L., and Papakonstantinou, Y. 2003. Efficient IR-style keyword search over relational databases. In Proceedings of the 29th International Conference on Very Large Data Bases.
 
22
Ilyas, I. F., Aref, W. G., and Elmagarmid, A. K. 2002. Joining ranked inputs in practice. In Proceedings of the 28th International Conference on Very Large Data Bases. 950--961.
 
23
Ilyas, I. F., Aref, W. G., and Elmagarmid, A. K. 2003. Supporting top-k join queries in relational databases. In Proceedings of the 29th International Conference on Very Large Data Bases. 754--765.
 
24
25
26
27
28
29
 
30
 
31
 
32
Raman, V., Deshpande, A., and Hellerstein, J. M. 2003. Using state modules for adaptive query processing. In Proceedings of the 19th International Conference on Data Engineering. 353--387.
33
 
34
35
 
36
37


Collaborative Colleagues:
Ihab F. Ilyas: colleagues
Walid G. Aref: colleagues
Ahmed K. Elmagarmid: colleagues
Hicham G. Elmongui: colleagues
Rahul Shah: colleagues
Jeffrey Scott Vitter: colleagues