| The effect of topic set size on retrieval experiment error |
| Full text |
Pdf
(351 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Tampere, Finland
SESSION: Evaluation
table of contents
Pages: 316 - 323
Year of Publication: 2002
ISBN:1-58113-561-0
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 80, Citation Count: 41
|
|
|
ABSTRACT
Retrieval mechanisms are frequently compared by computing the respective average scores for some effectiveness metric across a common set of information needs or topics, with researchers concluding one method is superior based on those averages. Since comparative retrieval system behavior is known to be highly variable across topics, good experimental design requires that a "sufficient" number of topics be used in the test. This paper uses TREC results to empirically derive error rates based on the number of topics used in a test and the observed difference in the average scores. The error rates quantify the likelihood that a different set of topics of the same size would lead to a different conclusion. We directly compute error rates for topic sets up to size 25, and extrapolate those rates for larger topic set sizes. The error rates found are larger than anticipated, indicating researchers need to take care when concluding one method is better than another, especially if few topics are used.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
C. W. Cleverdon, J. Mills, and E. M. Keen. Factors determining the performance of indexing systems. Two volumes, Cranfield, England, 1968.
|
| |
4
|
|
| |
5
|
M.-D. Lacasse. FUDGIT A multi-purpose data-processing and fitting program user's manual version 2.31. Technical report, Center for the Physics of Materials and Department of Physics, Montreal, Canada, April 1993.
|
| |
6
|
|
| |
7
|
K. Sparck Jones. Automatic indexing. Journal of Documentation, 30:393--432, 1974.
|
| |
8
|
K. Sparck Jones and C. van Rijsbergen. Information retrieval test collections. Journal of Documentation, 32(1):59--75, 1976.
|
| |
9
|
M. Twain. Life on the Mississippi, 1883. Reprinted at http://www.lhup.edu/~dsimanek/twain.htm.
|
| |
10
|
|
| |
11
|
|
CITED BY 41
|
|
David Carmel , Yoelle S. Maarek , Matan Mandelbrod , Yosi Mass , Aya Soffer, Searching XML documents via XML fragments, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, July 28-August 01, 2003, Toronto, Canada
|
|
|
|
|
|
Bodo Billerbeck , Falk Scholer , Hugh E. Williams , Justin Zobel, Query expansion using associated queries, Proceedings of the twelfth international conference on Information and knowledge management, November 03-08, 2003, New Orleans, LA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
W. S. Wong , R. W. P. Luk , H. V. Leong , K. S. Ho , D. L. Lee, Re-examining the effects of adding relevance information in a relevance feedback environment, Information Processing and Management: an International Journal, v.44 n.3, p.1086-1116, May, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Thomas Mandl , Christa Womser-Hacker , Giorgio Di Nunzio , Nicola Ferro, How robust are multilingual information retrieval systems?, Proceedings of the 2008 ACM symposium on Applied computing, March 16-20, 2008, Fortaleza, Ceara, Brazil
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|