ACM Home Page
Please provide us with feedback. Feedback
IR evaluation methods for retrieving highly relevant documents
Full text PdfPdf (770 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Athens, Greece
Pages: 41 - 48  
Year of Publication: 2000
ISBN:1-58113-226-3
Authors
Kalervo Järvelin  University of Tampere, Department of Information Studies, FIN-33014, University of Tampere, FINLAND
Jaana Kekäläinen  University of Tampere, Department of Information Studies, FIN-33014, University of Tampere, FINLAND
Sponsors
Athens U of Econ & Business : Athens University of Economics and Business
Greek Com Soc : Greek Computer Society
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 39,   Downloads (12 Months): 265,   Citation Count: 131
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/345508.345545
What is a DOI?

ABSTRACT

This paper proposes evaluation methods based on the use of non-dichotomous relevance judgements in IR experiments. It is argued that evaluation methods should credit IR methods for their ability to retrieve highly relevant documents. This is desirable from the user point of view in modern large IR environments. The proposed methods are (1) a novel application of P-R curves and average precision computations based on separate recall bases for documents of different degrees of relevance, and (2) two novel measures computing the cumulative gain the user obtains by examining the retrieval result up to a given ranked position. We then demonstrate the use of these evaluation methods in a case study on the effectiveness of query types, based on combinations of query structures and expansion, in retrieving documents of various degrees of relevance. The test was run with a best match retrieval system (In-Query1) in a text database consisting of newspaper articles. The results indicate that the tested strong query structures are most effective in retrieving highly relevant documents. The differences between the query types are practically essential and statistically significant. More generally, the novel evaluation methods and the case demonstrate that non-dichotomous relevance assessments are applicable in IR experiments, may reveal interesting phenomena, and allow harder testing of IR methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Allan, J. Callan, B. Croft, L. Ballesteros, J. Broglio, J. Xu & H. Shu. INQUERY at TREC 5. In E.M. Voorhees & D.K. Harrnan (Eds.), Information technology: The Fifth Text Retrieval Conference (TREC-5). Gaithersburg, MD: National Institute of Standards and Technology, 119-132, 1997.
2
3
 
4
W.J. Conover. Practical nonparametric statistics (2nd ed.). New York: John Wiley & Sons, 1980.
 
5
R. Green. The expression of conceptual syntagmatic relationships: A comparative survey. Journal of Documentation, 51(4): 315-338, 1995.
 
6
 
7
P. Ingwersen & P. Willett. An introduction to algorithmic and cognitive approaches for information retrieval. Libri, 450: 160-177, 1995.
 
8
 
9
J. Kekiliinen. The effects of query complexity, expansion and structure on retrieval performance in probabilistic text retrieval. Ph.D. dissertation. Department of Information Studies, University of Tampere, 1999.
 
10
11
 
12
 
13
 
14
S.E. Robertson & N.J. Belkin. Ranking in principle. Journal of Documentation, 34(2): 93-100, 1978.
 
15
T. Saracevic, P. Kantor, A. Chamis & D. Tnvison. A study of information seeking and retrieving. I. Background and methodology. Journal of the American Society for Information Science, 39(3): 161-176, 1988.
 
16
 
17
E. Sormunen. A Method for Measuring Wtde Range PerformancIe of Boolean Queries in Full-Text Databases. Ph.D. dssertation. Department of Information Studies, University of Tampere, 2000.
 
18

CITED BY  134

Collaborative Colleagues:
Kalervo Järvelin: colleagues
Jaana Kekäläinen: colleagues