|
ABSTRACT
Uncertainty pervades many domains in our lives. Current real-life applications, e.g., location tracking using GPS devices or cell phones, multimedia feature extraction, and sensor data management, deal with different kinds of uncertainty. Finding the nearest neighbor objects to a given query point is an important query type in these applications. In this paper, we study the problem of finding objects with the highest marginal probability of being the nearest neighbors to a query object. We adopt a general uncertainty model allowing for data and query uncertainty. Under this model, we define new query semantics, and provide several efficient evaluation algorithms. We analyze the cost factors involved in query evaluation, and present novel techniques to address the trade-offs among these factors. We give multiple extensions to our techniques including handling dependencies among data objects, and answering threshold queries. We conduct an extensive experimental study to evaluate our techniques on both real and synthetic data.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Topologically integrated geographic encoding and referencing (tiger) system, http://www.census.gov/geo/www/tiger/.
|
 |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
J. Chen and R. Cheng. Efficient evaluation of imprecise location-dependent queries. In ICDE, 2007.
|
| |
6
|
R. Cheng, J. Chen, M. Mokbel, and C. Chow. Probabilistic verifiers: Evaluating constrained nearest-neighbor queries over uncertain data. In ICDE, 2008.
|
| |
7
|
|
| |
8
|
X. Dai, M. L. Yiu, N. Mamoulis, Y. Tao, and M. Vaitis. Probabilistic spatial queries on existentially uncertain data. In SSTD, 2005.
|
 |
9
|
|
| |
10
|
|
| |
11
|
H. Franco-Lopez, A. R. Ek, and M. E. Bauer. Estimation and mapping of forest stand density, volume, and cover type using the k-nearest neighbors method. Remote Sensing of Environment 2001.
|
| |
12
|
J. Geweke. Efficient simulation from the multivariate normal and student---t distributions subject to linear constraints and the evaluation of constraint probabilities. In Computing Science and Statistics: Proceedings of the 23rd Symposium on the Interface, 1991.
|
| |
13
|
M. Hadjieleftheriou. Spatial index library, http://research.att.com/~marioh/spatialindex/.
|
| |
14
|
G. R. Hjaltason and H. Samet. Ranking in spatial databases. In SSD, 1995.
|
 |
15
|
Dmitri V. Kalashnikov , Yiming Ma , Sharad Mehrotra , Ramaswamy Hariharan, Index for fast retrieval of uncertain spatial point data, Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems, November 10-11, 2006, Arlington, Virginia, USA
[doi> 10.1145/1183471.1183504]
|
| |
16
|
H.-P. Kriegel, P. Kunath, and M. Renz. Probabilistic nearest-neighbor query on uncertain objects. In DASFAA, 2007.
|
 |
17
|
|
| |
18
|
E. Li, D. Boos, and M. Gumpertz. Simulation study in statistics. Journal of Interconnection Networks, 2001.
|
| |
19
|
|
| |
20
|
V. Ljosa and A. K. Singh. Apla: Indexing arbitrary probability distributions. In ICDE, 2007.
|
 |
21
|
|
 |
22
|
|
| |
23
|
K. Munagala, S. Babu, R. Motwani, and J. Widomy. The pipelined set cover problem. In ICDT, 2005.
|
| |
24
|
C. Re, N. Dalvi, and D. Suciu. Efficient top-k query evaluation on probabilistic data. In ICDE, 2007.
|
 |
25
|
|
| |
26
|
|
| |
27
|
M. A. Soliman, I. F. Ilyas, and K. C. Chang. Top-k query processing in uncertain databases. In ICDE, 2007.
|
| |
28
|
Yufei Tao , Reynold Cheng , Xiaokui Xiao , Wang Kay Ngai , Ben Kao , Sunil Prabhakar, Indexing multi-dimensional uncertain data with arbitrary probability density functions, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
| |
29
|
|
| |
30
|
|
| |
31
|
R. Yamamoto, H. Matsutani, H. Matsuki, T. Oono, and H. Ohtsuka. Position location technologies using signal strengths in cellular system. In VTC-Spring, 2001.
|
CITED BY 4
|
|
|
|
|
|
|
|
Jun-Seok Heo , Kyu-Young Whang , Min-Soo Kim , Yi-Reun Kim , Il-Yeol Song, The partitioned-layer index: Answering monotone top-k queries using the convex skyline and partitioning-merging technique, Information Sciences: an International Journal, v.179 n.19, p.3286-3308, September, 2009
|
|
|
|
|