ACM Home Page
Please provide us with feedback. Feedback
Secondary-storage confidence computation for conjunctive queries with inequalities
Full text PdfPdf (566 KB)
Source
International Conference on Management of Data archive
Proceedings of the 35th SIGMOD international conference on Management of data table of contents
Providence, Rhode Island, USA
SESSION: Research session 10: probabilistic databases I table of contents
Pages 389-402  
Year of Publication: 2009
ISBN:978-1-60558-551-2
Authors
Dan Olteanu  Oxford University, Oxford, United Kingdom
Jiewen Huang  Oxford University, Oxford, United Kingdom
Sponsors
ACM: Association for Computing Machinery
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 29,   Downloads (12 Months): 130,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1559845.1559887
What is a DOI?

ABSTRACT

This paper investigates the problem of efficiently computing the confidences of distinct tuples in the answers to conjunctive queries with inequalities (<) on tuple-independent probabilistic databases. This problem is fundamental to probabilistic databases and was recently stated open.

Our contributions are of both theoretical and practical importance. We define a class of tractable queries with inequalities, and generalize existing results on #P-hardness of query evaluation, now in the presence of inequalities.

For the tractable queries, we introduce a confidence computation technique based on efficient compilation of the lineage of the query answer into Ordered Binary Decision Diagrams (OBDDs), whose sizes are linear in the number of variables of the lineage.

We implemented a secondary-storage variant of our technique in PostgreSQL. This variant does not need to materialize the OBDD, but computes, in one scan over the lineage, the probabilities of OBDD fragments and combines them on the fly. Experiments with probabilistic TPC-H data show up to two orders of magnitude improvements when compared with state-of-the-art approaches.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Adar and C. Re. "Managing Uncertainty in Social Networks". IEEE Data Eng. Bull. 30(2), 2007.
 
2
L. Antova, C. Koch, and D. Olteanu. "10<sup>10</sup>6 Worlds and Beyond: Efficient Representation and Processing of Incomplete Information". In Proc. ICDE 2007.
 
3
 
4
 
5
6
7
 
8
A. Darwiche and P. Marquis. "A knowlege compilation map". Journal of AI Research 17: 229--264, 2002.
9
10
 
11
 
12
 
13
 
14
 
15
 
16
C. Re, N. Dalvi, and D. Suciu. Efficient top-k query evaluation on probabilistic data. In Proc. ICDE 2007.

Collaborative Colleagues:
Dan Olteanu: colleagues
Jiewen Huang: colleagues