ACM Home Page
Please provide us with feedback. Feedback
iDistance: An adaptive B+-tree based indexing method for nearest neighbor search
Full text PdfPdf (1.16 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 30 ,  Issue 2  (June 2005) table of contents
Pages: 364 - 397  
Year of Publication: 2005
ISSN:0362-5915
Authors
H. V. Jagadish  University of Michigan, Ann Arbor, MI
Beng Chin Ooi  National University of Singapore, Singapore
Kian-Lee Tan  National University of Singapore, Singapore
Cui Yu  Monmouth University, West Long Branch, NJ
Rui Zhang  National University of Singapore, Singapore
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 183,   Citation Count: 30
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1071610.1071612
What is a DOI?

ABSTRACT

In this article, we present an efficient B+-tree based indexing method, called iDistance, for K-nearest neighbor (KNN) search in a high-dimensional metric space. iDistance partitions the data based on a space- or data-partitioning strategy, and selects a reference point for each partition. The data points in each partition are transformed into a single dimensional value based on their similarity with respect to the reference point. This allows the points to be indexed using a B+-tree structure and KNN search to be performed using one-dimensional range search. The choice of partition and reference points adapts the index structure to the data distribution.We conducted extensive experiments to evaluate the iDistance technique, and report results demonstrating its effectiveness. We also present a cost model for iDistance KNN search, which can be exploited in query optimization.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
3
 
4
5
 
6
 
7
 
8
9
10
 
11
 
12
 
13
14
 
15
16
 
17
 
18
19
20
 
21
Jagadish, H., Ooi, B. C., Tan, K.-L., Yu, C., and Zhang, R. 2004. iDistance: An adaptive B+-tree based indexing method for nearest neighbor search. Tech. Rep. www.comp.nus.edu.sg/~ooibc, National University of Singapore.
 
22
Jolliffe, I. T. 1986. Principle Component Analysis. Springer-Verlag.
23
 
24
Koudas, N., Ooi, B. C., Tan, K.-L., and Zhang, R. 2004. Approximate NN queries on streams with guaranteed error/performance bounds. In Proceedings of the International Conference on Very Large Data Bases. 804--815.
 
25
Kruskal, J. B. 1956. On the shortest spanning subtree of a graph and the travelling salesman problem. In Proceedings of the American Mathematical Society 7, 48--50.
 
26
 
27
MacQueen, J. 1967. Some methods for classification and analysis of multivariate observations. In Fifth Berkeley Symposium on Mathematical statistics and probability. University of California Press, 281--297.
28
 
29
 
30
 
31
32
 
33
 
34
 
35
 
36
37

CITED BY  30

Collaborative Colleagues:
H. V. Jagadish: colleagues
Beng Chin Ooi: colleagues
Kian-Lee Tan: colleagues
Cui Yu: colleagues
Rui Zhang: colleagues