ACM Home Page
Please provide us with feedback. Feedback
Dynamic test collections: measuring search effectiveness on the live web
Full text PdfPdf (380 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Seattle, Washington, USA
SESSION: Evaluation 1--user models and test collections table of contents
Pages: 276 - 283  
Year of Publication: 2006
ISBN:1-59593-369-7
Author
Ian Soboroff  National Institute of Standards and Technology, Gaithersburg, MD
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 86,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1148170.1148220
What is a DOI?

ABSTRACT

Existing methods for measuring the quality of search algorithms use a static collection of documents. A set of queries and a mapping from the queries to the relevant documents allow the experimenter to see how well different search engines or engine configurations retrieve the correct answers. This methodology assumes that the document set and thus the set of relevant documents are unchanging. In this paper, we abandon the static collection requirement. We begin with a recent TREC collection created from a web crawl and analyze how the documents in that collection have changed over time. We determine how decay of the document collection affects TREC systems, and present the results of an experiment using the decayed collection to measure a live web search system. We employ novel measures of search effectiveness that are robust despite incomplete relevance information. Lastly, we propose a methodology of "collection maintenance" which supports measuring search performance both for a single system and between systems run at different points in time.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '98), Melbourne, Australia, August 1998. ACM Press.
 
2
Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2004), Sheffield, UK, July 2004. ACM Press.
3
 
4
Yaniv Bernstein and Justin Zobel. A scalable system for identifying co-derivative documents. In Proceedings of the Eleventh Symposium on String Processing and Information Retrieval (SPIRE 2004), Padova, Italy, October 2004.
 
5
 
6
7
8
 
9
10
 
11
Charles L. A. Clarke, Ian Soboroff, and Nick Craswell. Overview of the TREC 2004 terabyte track. In E. M. Voorhees and L. P. Buckland, editors, Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004), Gaithersburg, MD, November 2004.
 
12
C. W. Cleverdon. The cranfield tests on index langauge devices. In Aslib Proceedings, volume 19, pages 173--192, 1967.
13
 
14
15
16
17
18
19
20
 
21
Ellen M. Voorhees and Donna~K. Harman, editors. TREC: Experiments in Information Retrieval Evaluation. MIT Press, 2005.
22