ACM Home Page
Please provide us with feedback. Feedback
An experimental study of factors important in document ranking
Full text PdfPdf (809 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Palazzo dei Congressi, Pisa, Italy
Pages: 186 - 193  
Year of Publication: 1986
ISBN:0-89791-187-3
Author
Donna Williamson Harman  Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, Maryland
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 46,   Citation Count: 12
Additional Information:

abstract   references   cited by  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/253168.253206
What is a DOI?

ABSTRACT

The ability to effectively rank retrieved documents in order of their probable relevance to a query is a critical factor in statistically-based keyword retrieval systems. This paper summarizes a set of experiments with different methods of term weighting for documents, using measures of term importance within an entire document collection, term importance within a given document, and document length. It is shown that significant improvements over no term weighting can be made using a combination of weighting measures and normalizing for document length.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
CROFT79
Croft W.B., Harper D.I., "Using Probabilistic Models of Document R~tricval Without Relevance Information", Journal of Documentation, Vol. 35, No. 4, #mbe.r 1979, pp. 285-295.
 
CROFT82
Croft W.B., "Experiments with Roprcscntation in a Document Retrieval System", COIN Technical Report 82-21, May 1982.
 
DENNIS64
Dennis S.F., "The Construction of a Thesaurus Automat/cally from a Sample of Tcxt",Symposium Proc#gs, Statistical Association Methods for Mecb# Documentation, 1964. (National Bureau of Standards Miscellaneous Publication 269).
 
HARTER75
Haner $.P., "A Probabilisti~ Approach to Automatic Keyword Indexing", Journal of the Amesican Society for Information Science, VoL 26, No. 5, Octobe# 1973, pp. 280-289.
 
MCGILL79
McGiU M., Koll M., Noreault T., "An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems". Report, School of Inforration Smd/es, Syracuse University, Syracuse, New York, October 1979.
RAGHAVEN83
 
SALTON73
Salton G., Yang C.S., "On the Specilica#on of Term Values in Automti~ hdcxing", Journal of Documentation, Vol. 29, No. 4, December 1973, pp. 351-372.
 
SALTON83
 
SPARCK72
Sparck Jones K., "A Statistical Intmtn#tation of Term Specificity and Its Application in Retrieval", Your. hal of D ocungntation, Vol. 28, No. 1, March 1972, pp. 11- 20.
YU82

CITED BY  12