ACM Home Page
Please provide us with feedback. Feedback
Using WordNet to disambiguate word senses for text retrieval
Full text PdfPdf (898 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Pittsburgh, Pennsylvania, United States
Pages: 171 - 180  
Year of Publication: 1993
ISBN:0-89791-605-0
Author
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 115,   Citation Count: 65
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/160688.160715
What is a DOI?

ABSTRACT

This paper describes an automatic indexing procedure that uses the “IS-A” relations contained within WordNet and the set of nouns contained in a text to select a sense for each plysemous noun in the text. The result of the indexing procedure is a vector in which some of the terms represent word senses instead of word stems. Retrieval experiments comparing the effectivenss of these sense-based vectors vs. stem-based vectors show the stem-based vectors to be superior overall, although the sense-based vectors do improve the performance of some queries. The overall degradation is due in large part to the difficulty of disambiguating senses in short query statements. An analysis of these results suggests two conclusions: the IS-A links define a generalization/specialization hierarchy that is not sufficient to reliably select the correct sense of a noun from the set of fine sense distinctions in WordNet; and missing correct matches because of incorrect sense resolution has a much more deleterious effect on retrieval performance than does making spurious matches.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
 
4
George Miller. Special Issue, WordNet: An on-line lexical database. International Journal of Lexicography, 3(4)~ 1990.
 
5
 
6
Gerard Salton and Michael E. Lesk. Information analysis and dictionary construction. In Gerard Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, chapter 6, pages 115-142. Prentice-Hall, Inc. Englewood Cliffs, New Jersey, 1971.
 
7
Sally Yeates Sedelow and Donna Weir Mooney. Knowledge retrieval from domaintranscendent expert systems: II. research results. In Proceedings of the 51st Annual Meeting of the American Society of Information Science, pages 209-212, 1988.
 
8
Brian Michael Slator. Lexical Semantics and Preference Semantics Analysis. PhD thesis, New Mexico State University, Las Cruces, NM, December 1988.
 
9
Ellen M. Voorhees and Yuan-Wang Hou. Vector expansion in a large collection. In Proceedings of the First Text Retmeval Conference, 1992. Proceedings to appear.
 
10
Ellen M. Voorhees, Claudia Leacock, and Geoffrey Towell. Learning context to disambiguate word senses. In Proceedings of the 3rd Computational Learning Theory and Natural Learning Systems Conference, 1992. Proceedings to appear. Also available as Siemens technical report.
 
11
G.K. Zipf. The meaning-frequency relationship of words. Journal of General Psychology, 3:251-256, 1945.

CITED BY  65