|
ABSTRACT
In this paper, a probabilistic relational model is presented which combines relational algebra with probabilistic retrieval. Based on certain independence assumptions, the operators of the relational algebra are redefined such that the probabilistic algebra is a generalization of the standard relational algebra. Furthermore, a special join operator implementing probabilistic retrieval is proposed. When applied to typical document databases, queries can not only ask for documents, but for any kind of object in the database. In addition, an implicit ranking of these objects is provided in case the query relates to probabilistic indexing or uses the probabilistic join operator. The proposed algebra is intended as a standard interface to combined database and IR systems, as a basis for implementing user-friendly interfaces.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
Bookstein, A. (1983). Outline of a General Probabilistic Retrieval Model. Journal of Documental~,on 39(2), pages 63-72.
|
| |
4
|
Buckles, B.; Perry, F. (1982). A Fuzzy Represent, ation of Data for Relational Databases. Fuzzy Sets and Systems 7, pages 213-226.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
Fuhr, N.; Buckley, C. (1993). Optimizing Document Indexing and Search Term Weighting Based on Probabllistic Models. In: Harman, D. (ed.): The First Tezt REtrieval Conference (TREC1). National Institute of Standards and Technology Special Publication 500-207, Gaithersburg, Md. 20899.
|
| |
9
|
|
| |
10
|
|
 |
11
|
|
| |
12
|
|
| |
13
|
Macleod, I. (1991). Text Retrieval and the Relational Model. Journal of the Amer, can Society for Informat~.on Science 42(3), pages 155-165.
|
| |
14
|
|
| |
15
|
Pfeifer, U.; Fuhr, N. (1993). Aufwandsabschiitzung fiir die Prozessierung vager Anfragen auf der Basis des Datenstrom-Ansatzes. In: Stucky, W. (ed.): Datenbank~ysterne ~,n BTi;r'o, Technik und Wissenschaft, pages 375-392. Springer, Berlin et al.
|
| |
16
|
Prade, H.; Testemale, C. (1984). Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain information and Vague Queries. InformatzoTL Sczence 34, pages 115-143.
|
| |
17
|
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Klemens Böhm , Adrian Múller , Erich Neuhold, Structured document handling—a case for integrating databases and information retrieval, Proceedings of the third international conference on Information and knowledge management, p.147-154, November 29-December 02, 1994, Gaithersburg, Maryland, United States
|
|
|
|
|
|
|
|
|
|
|
|
Weifeng Su , Jiying Wang , Qiong Huang , Fred Lochovsky, Query result ranking over e-commerce web databases, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
Surajit Chaudhuri , Gautam Das , Vagelis Hristidis , Gerhard Weikum, Probabilistic ranking of database query results, Proceedings of the Thirtieth international conference on Very large data bases, p.888-899, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|