ACM Home Page
Please provide us with feedback. Feedback
Top-k selection queries over relational databases: Mapping strategies and performance evaluation
Full text PdfPdf (1.64 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 27 ,  Issue 2  (June 2002) table of contents
Pages: 153 - 187  
Year of Publication: 2002
ISSN:0362-5915
Authors
Nicolas Bruno  Columbia University, New York, NY
Surajit Chaudhuri  Microsoft Research, Redmond, WA
Luis Gravano  Columbia University, New York, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 28,   Downloads (12 Months): 104,   Citation Count: 43
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/568518.568519
What is a DOI?

ABSTRACT

In many applications, users specify target values for certain attributes, without requiring exact matches to these values in return. Instead, the result to such queries is typically a rank of the "top k" tuples that best match the given attribute values. In this paper, we study the advantages and limitations of processing a top-k query by translating it into a single range query that a traditional relational database management system (RDBMS) can process efficiently. In particular, we study how to determine a range query to evaluate a top-k query by exploiting the statistics available to an RDBMS, and the impact of the quality of these statistics on the retrieval efficiency of the resulting scheme. We also report the first experimental evaluation of the mapping strategies over a real RDBMS, namely over Microsoft's SQL Server 7.0. The experiments show that our new techniques are robust and significantly more efficient than previously known strategies requiring at least one sequential scan of the data sets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
Blake, C. and Merz, C. 1998. UCI repository of machine learning databases.
 
6
Bruno, N., Chaudhuri, S., and Gravano, L. 2000. Performance of multiattribute top-k queries on relational systems. Tech. Rep. CUCS-021-00, Columbia Univ., New York.
7
8
 
9
10
 
11
 
12
 
13
 
14
 
15
16
17
18
19
 
20
21
22
23
 
24
Muralikrishna, M. and DeWitt, D. J. 1988. Equi-depth histograms for estimating selectivity factors for multidimensional queries. In Proceedings of the 1988 ACM International Conference on Management of Data (SIGMOD'88). ACM, New York.
25
 
26
27
28
 
29
30
31
 
32
33
 
34

CITED BY  43

Collaborative Colleagues:
Nicolas Bruno: colleagues
Surajit Chaudhuri: colleagues
Luis Gravano: colleagues