|
ABSTRACT
Given two spatial datasets P (e.g., facilities) and Q (queries), an aggregate nearest neighbor (ANN) query retrieves the point(s) of P with the smallest aggregate distance(s) to points in Q. Assuming, for example, n users at locations q1,…qn, an ANN query outputs the facility p ∈ P that minimizes the sum of distances |pqi| for 1 ≤ i ≤ n that the users have to travel in order to meet there. Similarly, another ANN query may report the point p ∈ P that minimizes the maximum distance that any user has to travel, or the minimum distance from some user to his/her closest facility. If Q fits in memory and P is indexed by an R-tree, we develop algorithms for aggregate nearest neighbors that capture several versions of the problem, including weighted queries and incremental reporting of results. Then, we analyze their performance and propose cost models for query optimization. Finally, we extend our techniques for disk-resident queries and approximate ANN retrieval. The efficiency of the algorithms and the accuracy of the cost models are evaluated through extensive experiments with real and synthetic datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Swarup Acharya , Viswanath Poosala , Sridhar Ramaswamy, Selectivity estimation in spatial databases, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.13-24, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
 |
2
|
|
 |
3
|
|
 |
4
|
Norbert Beckmann , Hans-Peter Kriegel , Ralf Schneider , Bernhard Seeger, The R*-tree: an efficient and robust access method for points and rectangles, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.322-331, May 23-26, 1990, Atlantic City, New Jersey, United States
|
| |
5
|
|
 |
6
|
Stefan Berchtold , Christian Böhm , Daniel A. Keim , Hans-Peter Kriegel, A cost model for nearest neighbor search in high-dimensional data space, Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.78-86, May 11-15, 1997, Tucson, Arizona, United States
[doi> 10.1145/263661.263671]
|
| |
7
|
|
 |
8
|
|
 |
9
|
|
 |
10
|
Yuan-Chi Chang , Lawrence Bergman , Vittorio Castelli , Chung-Sheng Li , Ming-Ling Lo , John R. Smith, The onion technique: indexing for linear optimization queries, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.391-402, May 15-18, 2000, Dallas, Texas, United States
|
 |
11
|
|
 |
12
|
Antonio Corral , Yannis Manolopoulos , Yannis Theodoridis , Michael Vassilakopoulos, Closest pair queries in spatial databases, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.189-200, May 15-18, 2000, Dallas, Texas, United States
|
 |
13
|
|
 |
14
|
|
 |
15
|
|
 |
16
|
|
| |
17
|
Henrich, A. 1994. A distance scan algorithm for spatial access structures. In Proceedings of ACM Workshop on Geographic Information Systems (ACM GIS) (Gaithersburg, Md., Dec.). ACM, New York, 136--143.
|
 |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
Kolahdouzan, M. and Shahabi, C. 2004. Voronoi-based K nearest neighbor search for spatial network databases. In Proceedings of Very Large Data Bases Conference (VLDB) (Toronto, Ont., Canada, Aug.). 840--851.
|
| |
23
|
|
| |
24
|
|
| |
25
|
Liu, X. and Ferhatosmanoglu, H. 2003. Efficient k-NN search on streaming data series. In Proceedings of International Symposium on Advances in Spatial and Temporal Databases (SSTD) (Santorini Island, Greece, July). 83--101.
|
| |
26
|
|
| |
27
|
|
| |
28
|
|
| |
29
|
Papadias, D., Zhang, J., Mamoulis, N., and Tao, Y. 2003. Query processing in spatial network databases. In Proceedings of Very Large Data Bases Conference (VLDB) (Berlin, Germany, Sept.). 802--813.
|
| |
30
|
|
| |
31
|
Press, W., Flannery, B., Teukolsky, S., and Vetterling, W. 2002. Numerical recipes in C++ (second edition). Cambridge University Press, ISBN 0-521-75034-2.
|
 |
32
|
Nick Roussopoulos , Stephen Kelley , Frédéric Vincent, Nearest neighbor queries, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.71-79, May 22-25, 1995, San Jose, California, United States
|
| |
33
|
|
| |
34
|
Shan, J., Zhang, D., and Salzberg, B. 2003. On spatial-range closest-pair query. In Proceedings of International Symposium on Advances in Spatial and Temporal Databases (SSTD) (Santorini Island, Greece, July). 252--269.
|
| |
35
|
|
| |
36
|
Sproull, R. 1991. Refinements to nearest neighbor searching in K-dimensional trees. Algorithmica 6, 4, 579--589.
|
 |
37
|
|
| |
38
|
|
| |
39
|
|
| |
40
|
|
| |
41
|
Welzl, E. 1991. Smallest enclosing disks (balls and ellipsoids). In New Results and New Trends in Computer Science. Springer-Verlag, Lecture Notes in Computer Science, vol. 555. New York, 359--370.
|
| |
42
|
Wesolowsky, G. 1993. The Weber problem: History and perspectives. Loc. Sci. 1, 1, 5--23.
|
| |
43
|
|
| |
44
|
|
| |
45
|
|
CITED BY 15
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Humberto L. Razente , Maria Camila N. Barioni , Agma J. M. Traina , Caetano Traina, Jr., Aggregate similarity queries in relevance feedback methods for content-based image retrieval, Proceedings of the 2008 ACM symposium on Applied computing, March 16-20, 2008, Fortaleza, Ceara, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Humberto L. Razente , Maria Camila N. Barioni , Agma Juci M. Traina , Christos Faloutsos , Caetano Traina, Jr., A novel optimization approach to efficiently process aggregate similarity queries in metric access methods, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|