| An experimental study of factors important in document ranking |
| Full text |
Pdf
(809 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Palazzo dei Congressi, Pisa, Italy
Pages: 186 - 193
Year of Publication: 1986
ISBN:0-89791-187-3
|
|
Author
|
|
Donna Williamson Harman
|
Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, Maryland
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 46, Citation Count: 12
|
|
|
ABSTRACT
The ability to effectively rank retrieved documents in order of their probable relevance to a query is a critical factor in statistically-based keyword retrieval systems. This paper summarizes a set of experiments with different methods of term weighting for documents, using measures of term importance within an entire document collection, term importance within a given document, and document length. It is shown that significant improvements over no term weighting can be made using a combination of weighting measures and normalizing for document length.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
CROFT79
|
Croft W.B., Harper D.I., "Using Probabilistic Models of Document R~tricval Without Relevance Information", Journal of Documentation, Vol. 35, No. 4, #mbe.r 1979, pp. 285-295.
|
| |
CROFT82
|
Croft W.B., "Experiments with Roprcscntation in a Document Retrieval System", COIN Technical Report 82-21, May 1982.
|
| |
DENNIS64
|
Dennis S.F., "The Construction of a Thesaurus Automat/cally from a Sample of Tcxt",Symposium Proc#gs, Statistical Association Methods for Mecb# Documentation, 1964. (National Bureau of Standards Miscellaneous Publication 269).
|
| |
HARTER75
|
Haner $.P., "A Probabilisti~ Approach to Automatic Keyword Indexing", Journal of the Amesican Society for Information Science, VoL 26, No. 5, Octobe# 1973, pp. 280-289.
|
| |
MCGILL79
|
McGiU M., Koll M., Noreault T., "An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems". Report, School of Inforration Smd/es, Syracuse University, Syracuse, New York, October 1979.
|
 |
RAGHAVEN83
|
|
| |
SALTON73
|
Salton G., Yang C.S., "On the Specilica#on of Term Values in Automti~ hdcxing", Journal of Documentation, Vol. 29, No. 4, December 1973, pp. 351-372.
|
| |
SALTON83
|
|
| |
SPARCK72
|
Sparck Jones K., "A Statistical Intmtn#tation of Term Specificity and Its Application in Retrieval", Your. hal of D ocungntation, Vol. 28, No. 1, March 1972, pp. 11- 20.
|
 |
YU82
|
|
|