ACM Home Page
Please provide us with feedback. Feedback
Conditioning probabilistic databases
Full text PdfPdf (650 KB)
Source
Proceedings of the VLDB Endowment archive
Volume 1 ,  Issue 1  (August 2008) table of contents
SESSION: Query processing in uncertain databases table of contents
Pages 313-325  
Year of Publication: 2008
ISSN:2150-8097
Authors
Christoph Koch  Cornell University, Ithaca, NY
Dan Olteanu  Oxford University, Oxford, UK
Publisher
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 69,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1453856.1453894
What is a DOI?

ABSTRACT

Past research on probabilistic databases has studied the problem of answering queries on a static database. Application scenarios of probabilistic databases however often involve the conditioning of a database using additional information in the form of new evidence. The conditioning problem is thus to transform a probabilistic database of priors into a posterior probabilistic database which is materialized for subsequent query processing or further refinement. It turns out that the conditioning problem is closely related to the problem of computing exact tuple confidence values.

It is known that exact confidence computation is an NP-hard problem. This has led researchers to consider approximation techniques for confidence computation. However, neither conditioning nor exact confidence computation can be solved using such techniques. In this paper we present efficient techniques for both problems. We study several problem decomposition methods and heuristics that are based on the most successful search techniques from constraint satisfaction, such as the Davis-Putnam algorithm. We complement this with a thorough experimental evaluation of the algorithms proposed. Our experiments show that our exact algorithms scale well to realistic database sizes and can in some scenarios compete with the most efficient previous approximation algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
L. Antova, T. Jansen, C. Koch, and D. Olteanu. "Fast and Simple Relational Processing of Uncertain Data". In Proc. ICDE, 2008.
 
4
L. Antova, C. Koch, and D. Olteanu. "10106 Worlds and Beyond: Efficient Representation and Processing of Incomplete Information". In Proc. ICDE, 2007.
 
5
 
6
E. Birnbaum and E. Lozinskii. "The Good Old Davis-Putnam Procedure Helps Counting Models". Journal of AI Research, 10(6):457--477, 1999.
 
7
 
8
 
9
10
 
11
A. Darwiche and P. Marquis. "A knowlege compilation map". Journal of AI Research, 17:229--264, 2002.
12
13
14
 
15
J. Huang and D. Olteanu. Conjunctive queries with inequalities on probabilistic databases. Technical report, University of Oxford, 2008.
16
 
17
 
18
19
 
20
 
21
C. Re, N. Dalvi, and D. Suciu. Efficient top-k query evaluation on probabilistic data. In Proc. ICDE, pages 886--895, 2007.
 
22
A. D. Sarma, M. Theobald, and J. Widom. "Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases". In Proc. ICDE, 2008.
 
23
P. Sen and A. Deshpande. "Representing and Querying Correlated Tuples in Probabilistic Databases". In Proc. ICDE, pages 596--605, 2007.
 
24
 
25

CITED BY  8

Collaborative Colleagues:
Christoph Koch: colleagues
Dan Olteanu: colleagues