ACM Home Page
Please provide us with feedback. Feedback
A probabilistic relational algebra for the integration of information retrieval and database systems
Full text PdfPdf (2.10 MB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 15 ,  Issue 1  (January 1997) table of contents
Pages: 32 - 66  
Year of Publication: 1997
ISSN:1046-8188
Authors
Norbert Fuhr  Univ. of Dortmund, Dortmund, Germany
Thomas Rölleke  Univ. of Dortmund, Dortmund, Germany
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 144,   Citation Count: 56
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/239041.239045
What is a DOI?

ABSTRACT

We present a probabilistic relational algebra (PRA) which is a generalization of standard relational algebra. In PRA, tuples are assigned probabilistic weights giving the probability that a tuple belongs to a relation. Based on intensional semantics, the tuple weights of the result of a PRA expression always conform to the underlying probabilistic model. We also show for which expressions extensional semantics yields the same results. Furthermore, we discuss complexity issues and indicate possibilities for optimization. With regard to databases, the approach allows for representing imprecise attribute values, whereas for information retrieval, probabilistic document indexing and probabilistic search term weighting can be modeled. We introduce the concept of vague predicates which yield probabilistic weights instead of Boolean values, thus allowing for queries with vague selection conditions. With these features, PRA implements uncertainty and vagueness in combination with the relational model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
BILLINGSLEY, P. 1979. Probability and Measure. John Wiley and Sons, New York.
 
3
 
4
 
5
6
7
 
8
 
9
 
10
11
 
12
13
14
15
 
16
FUHR, N. AND ROLLEKE, T. 1996. A probabilistic NF2 relational algebra for integrated information retrieval and database systems. In Proceedings of the 2nd World Conference on Integrated Design and Process Technology (IDPT). Soc. for Design and Process Science, Austin, Tex.
17
 
18
GU, J., THIEL, U., AND ZHAO, J. 1993. Efficient retrieval of complex objects: Query processing in a hybrid DB and IR system. In Proceedings 1. GI-Fachtagung Information Retrieval. Universit#tsverlag Konstanz, Konstanz, Germany.
19
 
20
 
21
IMIELINSKI, T. 1989. Incomplete information in logical databases. Data Eng. Bull. 12, 2, 29-40.
22
23
 
24
KIM, W., Ed. 1989. Special issue on imprecision in databases. Data Eng. Bull. 12, 2.
 
25
 
26
27
28
 
29
MACLEOD, I. 1991. Text retrieval and the relational model. J. Am. Soc. Inf. Sci. 42, 3, 155-165.
30
31
 
32
33
 
34
 
35
PRADE, H. AND TESTEMALE, C. 1984. Generalizing database relational algebra for the treatment of incomplete/uncertain information and vague queries. Inf. Sci. 34, 115-143.
 
36
 
37
RAGHAVAN, V., SAXTON, L., WONG, S., AND TING, S. 1986. A unified architecture for the integration of data base management and information retrieval systems. In Information Processing 86, H.-J. Kugler, Ed. Elsevier, Amsterdam, 1049-1054.
 
38
REITER, R. 1984. Towards a logical reconstruction of relational database theory. In On Conceptual Modelling, M. Brodie, J. Mylopoulous, and J. Schmidt, Eds. Springer, New York.
 
39
ROLLEKE, T. 1994. Equivalences of the probabilistic relational algebra. Tech. Rep., Dept. of Computer Science, Univ. of Dortmund, Dortmund, Germany.
 
40
 
41
 
42
 
43
 
44
VAN RIJSBERGEN, C.J. 1986. A non-classical logic for information retrieval. Comput. J. 29, 6, 481-485.
45
 
46
47
 
48

CITED BY  56


REVIEW

"Fazli Can : Reviewer"

The information retrieval (IR) literature contains various studies on the integration of IR and database management systems. This paper presents a probabilistic relational algebra (PRA) for this purpose. PRA is a logical data model based on in  more...

Collaborative Colleagues:
Norbert Fuhr: colleagues
Thomas Rölleke: colleagues