ACM Home Page
Please provide us with feedback. Feedback
Multi-dimensional selectivity estimation using compressed histogram information
Full text PdfPdf (1.18 MB)
Source International Conference on Management of Data archive
Proceedings of the 1999 ACM SIGMOD international conference on Management of data table of contents
Philadelphia, Pennsylvania, United States
Pages: 205 - 214  
Year of Publication: 1999
ISBN:1-58113-084-8
Also published in ...
Authors
Ju-Hong Lee  Department of Information and Communication Engineering, Korea Advanced Institute of Science and Technology
Deok-Hwan Kim  Department of Information and Communication Engineering, Korea Advanced Institute of Science and Technology
Chin-Wan Chung  Department of Computer Science, Korea Advanced Institute of Science and Technology
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 56,   Citation Count: 45
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/304182.304200
What is a DOI?

ABSTRACT

The database query optimizer requires the estimation of the query selectivity to find the most efficient access plan. For queries referencing multiple attributes from the same relation, we need a multi-dimensional selectivity estimation technique when the attributes are dependent each other because the selectivity is determined by the joint data distribution of the attributes. Additionally, for multimedia databases, there are intrinsic requirements for the multi-dimensional selectivity estimation because feature vectors are stored in multi-dimensional indexing trees. In the 1-dimensional case, a histogram is practically the most preferable. In the multi-dimensional case, however, a histogram is not adequate because of high storage overhead and high error rates. In this paper, we propose a novel approach for the multi-dimensional selectivity estimation. Compressed information from a large number of small-sized histogram buckets is maintained using the discrete cosine transform. This enables low error rates and low storage overheads even in high dimensions. In addition, this approach has the advantage of supporting dynamic data updates by eliminating the overhead for periodical reconstructions of the compressed information. Extensive experimental results show advantages of the proposed approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
AFS93
BBK98
 
BKK96
 
BF95
CR94
CSZS97
CG96
 
Chri83
S. Christodoulakis. Estimating record selectivities. Information Systems Journal, 8(2), pp 105-115, 1983.
 
EKSWX98
 
EKSX96
M. Ester, H. Kriegel, J. Sander, X. Xu. A Density Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proc. 2nd Int. Conf. on Knowledge Discovering and Data Mining, 1996.
Fa96
Fa98
GRS98
 
HNSS95
 
Io93
IP95
 
JKMPSS98
 
Lim90
MCS98
 
NH94
PSTW93
PIHS96
 
PI97
 
RY90
SCZ98
SLRD93
 
WKW94
ZRL96

CITED BY  45

Collaborative Colleagues:
Ju-Hong Lee: colleagues
Deok-Hwan Kim: colleagues
Chin-Wan Chung: colleagues