ACM Home Page
Please provide us with feedback. Feedback
Indexing uncertain data
Full text PdfPdf (586 KB)
Source
Symposium on Principles of Database Systems archive
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Providence, Rhode Island, USA
SESSION: Data analysis and optimization table of contents
Pages 137-146  
Year of Publication: 2009
ISBN:978-1-60558-553-6
Authors
Pankaj K. Agarwal  Duke University, Durham, NC, USA
Siu-Wing Cheng  HKUST, Hong Kong, Hong Kong
Yufei Tao  CUHK, Hong Kong, Hong Kong
Ke Yi  HKUST, Hong Kong, Hong Kong
Sponsors
ACM: Association for Computing Machinery
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGMOD: ACM Special Interest Group on Management of Data
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1559795.1559816
What is a DOI?

ABSTRACT

Querying uncertain data has emerged as an important problem in data management due to the imprecise nature of many measurement data. In this paper we study answering range queries over uncertain data. Specifically, we are given a collection P of n points in R, each represented by its one-dimensional probability density function (pdf). The goal is to build an index on P such that given a query interval I and a probability threshold τ, we can quickly report all points of P that lie in I with probability at least τ. We present various indexing schemes with linear or near-linear space and logarithmic query time. Our schemes support pdf's that are either histograms or more complex ones such as Gaussian or piecewise algebraic. They also extend to the external memory model in which the goal is to minimize the number of disk accesses when querying the index.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
P.K. Agarwal and J. Erickson, Geometric range searching and its relatives, in: Advances in Discrete and Computational Geometry (B. Chazelle, J.E. Goodman, and R. Pollack, eds.), American Mathematical Society, Providence, RI, 1999, pp. 1--56.
 
4
P.K. Agarwal and J. Matoušek, Dynamic half-space range reporting and its applications, Algorithmica, 13 (1995), 325--345.
 
5
P.K. Agarwal and M. Sharir, Davenport-Schinzel sequences and their geometric applications, in: Handbook of Computational Geometry (J.-R. Sack and J. Urrutia, eds.), Elsevier Science Publishers, Amsterdam, 2000, pp. 1--47.
6
 
7
8
 
9
J.L. Bentley and J.B. Saxe, Decomposable searching problems I: Static-to-dynamic transformation, Journal of Algorithms, 1 (1980), 301--358.
 
10
 
11
B. Chazelle and L.J. Guibas, Fractional cascading: I. A data structuring technique, Algorithmica, 1 (1986), 133--162.
 
12
 
13
 
14
 
15
 
16
 
17
H. Davenport and A. Schinzel, A combinatorial problem connected with differential equations, American Journal of Mathematics, 87 (1965), 684--689.
 
18
 
19
V. Ljosa and A.K. Singh, APLA: Indexing arbitrary probability distributions, Proc. IEEE International Conference on Data Engineering, 2007, pp. 946--955.
 
20
 
21
 
22
S. Singh, C. Mayfield, S. Prabhakar, R. Shah, and S. Hambrusch, Indexing uncertain categorical data, Proc. IEEE International Conference on Data Engineering, 2007, pp. 616--625.
 
23
24
 
25

Collaborative Colleagues:
Pankaj K. Agarwal: colleagues
Siu-Wing Cheng: colleagues
Yufei Tao: colleagues
Ke Yi: colleagues