ACM Home Page
Please provide us with feedback. Feedback
“Is this document relevant?…probably”: a survey of probabilistic models in information retrieval
Full text PdfPdf (266 KB)
Source ACM Computing Surveys (CSUR) archive
Volume 30 ,  Issue 4  (December 1998) table of contents
Pages: 528 - 552  
Year of Publication: 1998
ISSN:0360-0300
Authors
Fabio Crestani  Univ. of Glasgow, Glasgow, Scotland, UK
Mounia Lalmas  Univ. of Glasgow, Glasgow, Scotland, UK
Cornelis J. Van Rijsbergen  Univ. of Glasgow, Glasgow, Scotland, UK
Iain Campbell  Univ. of Glasgow, Glasgow, Scotland, UK
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 29,   Downloads (12 Months): 253,   Citation Count: 33
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/299917.299920
What is a DOI?

ABSTRACT

This article surveys probablistic approaches to modeling information retrieval. The basic concepts of probabilistic approaches to information retrieval are outlined and the principles and assumptions upon which the approaches are based are presented. The various models proposed in the development of IR are described, classified, and compared using a common formalism. New approaches that constitute the basis of future research are described.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
AMATI, G. AND KERPEDJIEV, S. 1992. An information retrieval logical model: Implementation and experiments. Tech. Rep. Rel 5B04892 (March), Fondazione Ugo Bordoni, Roma, Italy.
 
2
AMATI, G. AND VAN RIJSBERGEN, C. J. 1995. Probability, information and information retrieval. In Proceedings of the First International Workshop on Information Retrieval, Uncertainty and Logic (Glasgow, Sept.).
3
 
4
BOOKSTEIN, A. AND COOPER, W.S. 1976. A general mathematical model for information retrieval systems. Libr. Quart. 46, 2.
 
5
BOOKSTEIN, A. AND SWANSON, D. 1974. Probabilistic models for automatic indexing. J. Am. Soc. Inf. Sci. 25, 5, 312-318.
 
6
BORGOGNA, G. AND PASI, G. 1993. A fuzzy linguistic approach generalizing Boolean information retrieval: A model and its evaluation. J. Am. Soc. Inf. Sci. 2, 70-82, 44.
 
7
BRUZA, P. D. 1993. Stratified information disclosure: A synthesis between hypermedia and information retrieval. Ph.D. Thesis, Katholieke Universiteit Nijmegen, The Netherlands.
 
8
 
9
CAMPBELL, I. AND VAN RIJSBERGEN, C. J. 1996. The ostensive model of developing information needs. In Proceedings of CoLIS 2 (Copenhagen, Oct.), 251-268.
 
10
 
11
COOPER, W. S. 1971. A definition of relevance for information retrieval. Inf. Storage Retrieval 7, 19-37.
12
13
 
14
Cox, D. R. 1970. Analysis of Binary Data. Methuen, London.
 
15
CRESTANI, F. AND VAN RIJSBERGEN, C.J. 1995a. Information retrieval by logical imaging. J. Doc. 51, 1, 1-15.
16
 
17
 
18
CROFT, W. B. AND HARPER, D. J. 1979. Using probabilistic models of document retrieval without relevance information. J. Doc. 35, 285-295.
 
19
20
 
21
22
 
23
DE SILVA, W. T. AND MILIDIU, R.L. 1993. Belief function model for information retrieval. J. Am. Soc. Inf. Sci. 4, 1, 10-18.
 
24
DEMPSTER, A. P. 1968. A generalization of the Bayesian inference. J. Royal Stat. Soc. 30, 205-447.
 
25
DUNLOP, M. D. 1991. Multimedia information retrieval. Ph.D. Thesis, Department of Computing Science, University of Glasgow, Glasgow.
 
26
 
27
28
 
29
30
31
 
32
FUHR, N. AND BUCKLEY, C. 1993. Optimizing document indexing and search term weighting based on probabilistic models. In The First Text Retrieval Conference (TREC-1), D. Harman, Ed., Special Publication 500-207. National Institute of Standards and Technology, Gaithersburg, MD, 89-100.
 
33
34
35
 
36
GOOD, I.J. 1950. Probability and the Weighing of Evidence. Charles Griffin Symand.
 
37
38
39
 
40
HARMAN, D. 1996. Overview of the fifth text retrieval conference (TREC-5). In Proceeding of the TREC Conference (Gaithersburg, MD, Nov.).
 
41
 
42
HARTER, S.P. 1975. A probabilistic approach to automatic keyword indexing: Part 1. J. Am. Soc. Inf. Sci. 26, 4, 197-206.
 
43
HUIBERS, T. W. C. 1996. An axiomatic theory for information retrieval. Ph.D. Thesis, Utrecht University, The Netherlands.
 
44
JEFFREY, R. C. 1965. The Logic of Decision. McGraw-Hill, New York.
45
 
46
LALMAS, M. 1992. A logic model of information retrieval based on situation theory. In Proceedings of the Fourteenth BCS Information Retrieval Colloquium (Lancaster, UK, Dec.).
 
47
48
 
49
50
 
51
MILLER, W. L. 1971. A probabilistic search strategy for MEDLARS. J. Doc. 27, 254-266.
 
52
MIZZARO, S. 1996. Relevance: The whole (hi)story. Tech. Rep. UDMI/12/96/RR (Dec.), Dipartimento di Matematica e Informatica, Universita' di Udine, Italy.
 
53
54
 
55
NIL, J.Y. 1989. An information retrieval model based on modal logic. Inf. Process. Manage. 25, 5, 477-491.
56
 
57
NIL, J. Y., LEPAGE, F., AND BRISEBOIS, M. 1996. Information retrieval as counterfactual. Comput. J. 38, 8, 643-657.
 
58
 
59
PEARL, g. 1990. Jeffrey's rule, passage of experience and Neo-Bayesianism. In Knowledge Representation and Defeasible Reasoning, H. E. Kyburg, R. P. Luoi, and G. N. Carlson, Eds., Kluwer Academic, Dordrecht, The Netherlands, 245-265.
 
60
ROBERTSON, S.E. 1977. The probability ranking principle in IR. J. Doc. 33, 4 (Dec.), 294-304.
 
61
ROBERTSON, S. E. AND SPARCK JONES, K. 1976. Relevance weighting of search terms. J. Am. Soc. Inf. Sci. 27, 129-146.
 
62
 
63
ROBERTSON, S. E., MARON, M. E., AND COOPER, W.S. 1982. Probability of relevance: A unification of two competing models for document retrieval. Inf. Technol. Res. Dev. 1, 1-21.
 
64
 
65
 
66
 
67
 
68
SEMBOK, T. M. T. AND VAN RIJSBERGEN, C. J. 1993. Imaging: A relevance feedback retrieval with nearest neighbour clusters. In Proceedings of the BCS Colloquium in Information Retrieval (Glasgow, March), 91-107.
 
69
SERACEVIC, T. 1970. The concept of "relevance" in information science: A historical review. In Introduction to Information Science, T. Seracevic, Ed., R. R. Bower, New York, Chapter 14.
 
70
SHAPER, G. 1976. A Mathematical Theory of Evidence. Princeton University Press, Princeton, NJ.
 
71
SMITH, S. AND STANFILL, C. 1988. An analysis of the effects of data corruption on text retrieval performance. Tech. Rep. (Dec.), Thinking Machines Corporation, Cambridge, MA.
 
72
 
73
STALNAKER, R. 1981. Probability and conditionals. In Ifs, W. L. Harper, R. Stalnaker, and G. Pearce, Eds., The University of Western Ontario Series in Philosophy of Science, D. Riedel, Dordrecht, Holland, 107-128.
 
74
 
75
 
76
 
77
78
79
 
80
 
81
TURTLE, H. R. AND CROFT, W.B. 1992b. Uncertainty in information retrieval systems. Unpublished paper.
 
82
VAN RIJSBERGEN, C.J. 1977. A theoretical basis for the use of co-occurrence data in information retrieval. J. Doc. 33, 2 (June), 106-119.
 
83
 
84
VAN RIJSBERGEN, C. J. 1986. A non-classical logic for information retrieval. Comput. J. 29, 6, 481-485.
85
 
86
VAN RIJSBERGEN, C. J. 1992. Probabilistic retrieval revisited. Departmental Research Report 1992/R2 (Jan.), Computing Science Department, University of Glasgow, Glasgow.
 
87
 
88
89
 
90

CITED BY  33


REVIEW

"Karen Sparck-Jones : Reviewer"

This useful review provides a competent, clear, and accessible account of retrieval models that take probability as their grounding notion in defining the relevance relation between queries and documents. These models ar  more...

Collaborative Colleagues:
Fabio Crestani: colleagues
Mounia Lalmas: colleagues
Cornelis J. Van Rijsbergen: colleagues
Iain Campbell: colleagues