ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Estimating retrieval effectiveness using rank distributions
Full text PdfPdf (630 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session 2/information retrieval table of contents
Pages: 1425-1426  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Vishwa Vinay  Microsoft Research, Cambridge, United Kingdom
Natasa Milic-Frayling  Microsoft Research, Cambridge, United Kingdom
Ingemar Cox  University College London, London, United Kingdom
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 81,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458314
What is a DOI?

ABSTRACT

In this paper, we consider the task of estimating query effectiveness, i.e., assessment of the retrieval system performance in absence of the user relevance judgments. In our approach we model the score associated with each document in the result set as a Gaussian random variable. The mean and the variance of each document score can then be used to estimate the probability that a document will be ranked above another one and thus calculate the expected rank of the document in the ranked list. We propose to measure the effectiveness of the system performance by comparing the predicted and actual ranks of the retrieved documents. In our experiments we consider two retrieval models and five document scoring methods and evaluate their impact on the proposed estimation measures. Our experiments with standardized data sets that include document relevance judgments and the task of predicting the relative query effectiveness show that the expected rank metric is robust to variations in document scoring and retrieval algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
 
6
Zhu, X., Lafferty, J. and Ghahramani, Z. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions. Proc. of the ICML 2003 Workshop on The Continuum from Labeled to Unlabeled Data in ML and Data Mining
 
7
The Lemur Toolkit for Language Modeling and Information Retrieval, http://www.lemurproject.org

Collaborative Colleagues:
Vishwa Vinay: colleagues
Natasa Milic-Frayling: colleagues
Ingemar Cox: colleagues