ACM Home Page
Please provide us with feedback. Feedback
Retrieval evaluation with incomplete information
Full text PdfPdf (199 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Sheffield, United Kingdom
SESSION: Test collections table of contents
Pages: 25 - 32  
Year of Publication: 2004
ISBN:1-58113-881-4
Authors
Chris Buckley  Sabir Research, Inc., Gaithersburg, MD
Ellen M. Voorhees  National Institute of Standards and Technology, Gaithersburg, Maryland
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 169,   Citation Count: 76
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1008992.1009000
What is a DOI?

ABSTRACT

This paper examines whether the Cranfield evaluation methodology is robust to gross violations of the completeness assumption (i.e., the assumption that all relevant documents within a test collection have been identified and are present in the collection). We show that current evaluation measures are not robust to substantially incomplete relevance judgments. A new measure is introduced that is both highly correlated with existing measures when complete judgments are available and more robust to incomplete judgment sets. This finding suggests that substantially larger or dynamic test collections built using current pooling practices should be viable laboratory tools, despite the fact that the relevance information will be incomplete and imperfect.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Chris Buckley. trec_eval IR evaluation package. Available from ftp://ftp.cs.cornell.edu/pub/smart.
2
3
4
5
 
6
 
7
 
8
Google. Benefits of a Google search. http://www.google.com/technology/whyuse.html, January 2004.
 
9
Stefano Mizzaro. A new measure of retrieval effectiveness(Or: What's wrong with precision and recall). In Proceedings of the International Workshop on Information Retrieval(IR'2001), pages 43--52, 2001.
10
 
11
Mark E. Rorvig. The simple scalability of documents. Journal of the American Society for Information Science, 41(8):590--598, 1990.
12
 
13
K. Sparck Jones and C. van Rijsbergen. Report on the need for and provision of an "ideal" information retrieval test collection. British Library Research and Development Report 5266, Computer Laboratory, University of Cambridge, 1975.
 
14
C. J. van Rijsbergen. Evaluation, chapter 7. Butterworths, 2 edition, 1979.
 
15
16
 
17
18
 
19
Ellen M. Voorhees and Donna Harman. Overview of the seventh Text REtrieval Conference(TREC-7). In Proceedings of the Seventh Text REtrieval Conference(TREC-7), pages 1--23, 1999. NIST Special Publication 500--242.
 
20
21

CITED BY  76

Collaborative Colleagues:
Chris Buckley: colleagues
Ellen M. Voorhees: colleagues