ACM Home Page
Please provide us with feedback. Feedback
Optimum polynomial retrieval functions based on the probability ranking principle
Full text PdfPdf (1.56 MB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 7 ,  Issue 3  (July 1989) table of contents
Pages: 183 - 204  
Year of Publication: 1989
ISSN:1046-8188
Author
Norbert Fuhr  Technische Hochschule Darmstadt, Darmstadt, W. Germany
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 47,   Citation Count: 29
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/65943.65944
What is a DOI?

ABSTRACT

We show that any approach to developing optimum retrieval functions is based on two kinds of assumptions: first, a certain form of representation for documents and requests, and second, additional simplifying assumptions that predefine the type of the retrieval function. Then we describe an approach for the development of optimum polynomial retrieval functions: request-document pairs (fl, dm) are mapped onto description vectors x(fl, dm), and a polynomial function e(x) is developed such that it yields estimates of the probability of relevance P(R | x (fl, dm) with minimum square errors. We give experimental results for the application of this approach to documents with weighted indexing as well as to documents with complex representations. In contrast to other probabilistic models, our approach yields estimates of the actual probabilities, it can handle very complex representations of documents and requests, and it can be easily applied to multivalued relevance scales. On the other hand, this approach is not suited to log-linear probabilistic models and it needs large samples of relevance feedback data for its application.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
BOLLMANN, P., JOCHUM, R., REINER, U., WEISSMANN, V., AND ZUSE, H. Planung und Durchfiihrung der Retrievaltests. In Leistungsbewertung yon Information Retrieval Verfahren (LIVE), H.-J. Schneider et al., eds. TU Berlin, Fachbereich Informatik, Computergestfitzte Informationssysteme (CIS), Institut fiir Angewandte Informatik, 1986, pp. 183-212.
 
3
BOOKSTEIN, A. Logtinear Analysis of Library Data. Research Report, OCLC, Office of Research, 1988.
 
4
BOOKSTEIN, A. Outline of a general probabilistic retrieval model. J. Doc. 39, 2 (1983), 63-72.
 
5
 
6
DUI)A, R. O., AND HART, P.E. Pattern Classification and Scene Analysis. Wiley, New York, 1973.
 
7
 
8
FUnR, N. A probabilistic model of dictionary based automatic indexing. In Proceedings of the Riao 85 (Recherche d'informations Assistee par Ordinateur) (Grenoble, France, March 18-20). 1985, pp. 207-216.
 
9
FUHR, N. Probabilistisches lndexing und Retrieval. Fachinformationszentrum Karlsruhe, Eggenstein-Leopoldshafen, West Germany, 1988.
10
11
 
12
 
13
14
 
15
KEEN, E. M. Evaluation parameters. In The SMART Retrieval System--Experiments in Automatic Document Processing, G. Salton, ed. Prentice Hall, Englewood Cliffs, N.J., 1971, pp. 74-112.
 
16
KNORZ, G. Automatisches Indexieren als Erkennen abstrakter Objekte. Niemeyer, Tfibingen, West Germany, 1983.
 
17
 
18
KONSTANTIN, J. Untersuchung yon nach dem Quadratmittel-Polynomansatz erstellten Ranking{unktionen. Diplomarbeit, TH Darmstadt, FB Informatik, Datenverwaltungssysteme II, Darmstadt, West Germany 1985.
 
19
LUSTlCL G. Automatische Indexierung zwischen Forschung und Anwendung. Olms, Hildesheim, West Germany 1986.
 
20
 
21
ROBERTSON, S.E. The probability ranking principle in IR. J. Doc. 33 (1977), 294-304.
 
22
ROBERTSON, S. E., MARON, M. E., AND COOPER, W.S. Probability of relevance: A unification of two competing models for document retrieval. Inf. Tech. Res. 1 (1982), 1-21.
 
23
ROCCHIO, J.J. Relevance feedback in information retrieval. In The SMART Retrieval System~ Experiments in Automatic Document Processing, G. Salton, ed. Prentice Hall, Englewood Cliffs, N.J., 1971.
 
24
 
25
SCHORMANN, J. Polynomklassifikatoren fur die Zeichenerkennung. Ansatz, Adaption, Anwendung. Oldenbourg, M/inchen, West Germany, 1977.
26

CITED BY  29


REVIEW

"Kathleen H. V. Booth : Reviewer"

This excellent paper describes an application of the least squares polynomial method, previously used in automatic indexing, to the classification of request-document pairs in information retrieval. The retrieval functions developed provide both  more...