ACM Home Page
Please provide us with feedback. Feedback
Rank-biased precision for measurement of retrieval effectiveness
Full text PdfPdf (209 KB)
Source
ACM Transactions on Information Systems (TOIS) archive
Volume 27 ,  Issue 1  (December 2008) table of contents
Article No. 2  
Year of Publication: 2008
ISSN:1046-8188
Authors
Alistair Moffat  The University of Melbourne, Victoria, Australia
Justin Zobel  RMIT University and NICTA Victoria Research Laboratory, Victoria, Australia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 46,   Downloads (12 Months): 448,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1416950.1416952
What is a DOI?

ABSTRACT

A range of methods for measuring the effectiveness of information retrieval systems has been proposed. These are typically intended to provide a quantitative single-value summary of a document ranking relative to a query. However, many of these measures have failings. For example, recall is not well founded as a measure of satisfaction, since the user of an actual system cannot judge recall. Average precision is derived from recall, and suffers from the same problem. In addition, average precision lacks key stability properties that are needed for robust experiments. In this article, we introduce a new effectiveness metric, rank-biased precision, that avoids these problems. Rank-biased pre-cision is derived from a simple model of user behavior, is robust if answer rankings are extended to greater depths, and allows accurate quantification of experimental uncertainty, even when only partial relevance judgments are available.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
 
6
Buckley, C. and Voorhees, E. M. 2005. Retrieval system evaluation. In TREC: Experiment and Evaluation in Information Retrieval. MIT Press, Cambridge, MA, Chapter 3, 53--75.
7
8
 
9
Cooper, W. S. 1968. Expected search length: A single measure of retrieval effectiveness based on weak ordering action of retrieval systems. Amer. Document. 19, 1 (Jan.), 30--41.
 
10
Cooper, W. S. 1973. On selecting a measure of retrieval effectiveness: Part I, the ‘subjective’ philosophy of evaluation. J. Amer. Soc. Inform. Sci. 24, 87--100.
11
 
12
 
13
 
14
 
15
16
17
18
 
19
 
20
 
21
 
22
 
23
Marchionini, G., Moffat, A., Tate, J., Baeza-Yates, R., and Ziviani, N., Eds. 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, NY.
 
24
Meng, X. and Chen, Z. 2004. On user-oriented measurements of effectiveness of Web information retrieval systems. In Proceedings of the International Conference on Internet Computing, H. R. Arabnia, O. Droegehorn, and S. Chatterjee, Eds. CSREA Press, Las Vegas, NV, 527--533.
 
25
26
27
 
28
Sakai, T. 2004. Ranking the NTCIR systems based on multigrade relevance. In Proceedings of the AIRS Asian Information Retrieval Symposium. Lecture Notes in Computer Science, vol. 3411. Springer, Berlin, Germany, 251--262.
29
30
31
 
32
Shaw, Jr., W. M. 1986. On the foundation of evaluation. J. Amer. Soc. Inform. Sci. 37, 5, 346--348.
 
33
 
34
 
35
 
36
37
 
38
39


Collaborative Colleagues:
Alistair Moffat: colleagues
Justin Zobel: colleagues