| Do batch and user evaluations give the same results? |
| Full text |
Pdf
(729 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Athens, Greece
Pages: 17 - 24
Year of Publication: 2000
ISBN:1-58113-226-3
|
|
Authors
|
|
William Hersh
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Andrew Turpin
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Susan Price
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Benjamin Chan
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Dale Kramer
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Lynetta Sacherek
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
Daniel Olson
|
Division of Medical Informatics & Outcomes Research, Oregon Health Sciences University, Portland, OR
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 64, Citation Count: 22
|
|
|
ABSTRACT
Do improvements in system performance demonstrated by batch evaluations confer the same benefit for real users? We carried out experiments designed to investigate this question. After identifying a weighting scheme that gave maximum improvement over the baseline in a non-interactive evaluation, we used it with real users searching on an instance recall task. Our results showed the weighting scheme giving beneficial results in batch studies did not do so with real users. Further analysis did identify other factors predictive of instance recall, including number of documents saved by the user, document recall, and number of documents seen by the user.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
John P. Chin , Virginia A. Diehl , Kent L. Norman, Development of an instrument measuring user satisfaction of the human-computer interface, Proceedings of the SIGCHI conference on Human factors in computing systems, p.213-218, May 15-19, 1988, Washington, D.C., United States
[doi> 10.1145/57167.57203]
|
| |
2
|
C. Cleverdon and E. Keen, Factors determining the performance of indexin systems, Cranfield UK: Aslib Cranfield Research Project 1966.
|
 |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
C. Meadow, Relevance?, Journal of the American Society for Information Science, 36: 354-355, 1985.
|
| |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
D. Swanson, Information retrieval as a trial-anderror process, Library Quarterly, 47: 128-148, 1977.
|
| |
11
|
|
 |
12
|
|
CITED BY 22
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mingfang Wu , Gheorghe Muresan , Alistair McLean , Muh-Chyun (Morris) Tang , Ross Wilkinson , Yuelin Li , Hyuk-Jin Lee , Nichloas J. Belkin, Human versus machine in the topic distillation task, Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, July 25-29, 2004, Sheffield, United Kingdom
|
|
|
|
|
|
|
|
|
Jimmy Lin , Philip Wu , Dina Demner-Fushman , Eileen Abels, Exploring the limits of single-iteration clarification dialogs, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
|
|
|
|
|
Nina Wacholder , Diane Kelly , Paul Kantor , Robert Rittman , Ying Sun , Bing Bai , Sharon Small , Boris Yamrom , Tomek Strzalkowski, A model for quantitative evaluation of an end-to-end question-answering system, Journal of the American Society for Information Science and Technology, v.58 n.8, p.1082-1099, June 2007
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrew Turpin , Falk Scholer , Kalvero Jarvelin , Mingfang Wu , J. Shane Culpepper, Including summaries in system evaluation, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
Susan L. Price , Marianne Lykke Nielsen , Lois M. L. Delcambre , Peter Vedsted , Jeremy Steinhauer, Using semantic components to search for domain-specific documents: An evaluation from the system perspective and the user perspective, Information Systems, v.34 n.8, p.778-806, December, 2009
|
|
|
|
|