|
ABSTRACT
This paper describes an automatic indexing procedure that uses the “IS-A” relations contained within WordNet and the set of nouns contained in a text to select a sense for each plysemous noun in the text. The result of the indexing procedure is a vector in which some of the terms represent word senses instead of word stems. Retrieval experiments comparing the effectivenss of these sense-based vectors vs. stem-based vectors show the stem-based vectors to be superior overall, although the sense-based vectors do improve the performance of some queries. The overall degradation is due in large part to the difficulty of disambiguating senses in short query statements. An analysis of these results suggests two conclusions: the IS-A links define a generalization/specialization hierarchy that is not sufficient to reliably select the correct sense of a noun from the set of fine sense distinctions in WordNet; and missing correct matches because of incorrect sense resolution has a much more deleterious effect on retrieval performance than does making spurious matches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
George Miller. Special Issue, WordNet: An on-line lexical database. International Journal of Lexicography, 3(4)~ 1990.
|
| |
5
|
|
| |
6
|
Gerard Salton and Michael E. Lesk. Information analysis and dictionary construction. In Gerard Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, chapter 6, pages 115-142. Prentice-Hall, Inc. Englewood Cliffs, New Jersey, 1971.
|
| |
7
|
Sally Yeates Sedelow and Donna Weir Mooney. Knowledge retrieval from domaintranscendent expert systems: II. research results. In Proceedings of the 51st Annual Meeting of the American Society of Information Science, pages 209-212, 1988.
|
| |
8
|
Brian Michael Slator. Lexical Semantics and Preference Semantics Analysis. PhD thesis, New Mexico State University, Las Cruces, NM, December 1988.
|
| |
9
|
Ellen M. Voorhees and Yuan-Wang Hou. Vector expansion in a large collection. In Proceedings of the First Text Retmeval Conference, 1992. Proceedings to appear.
|
| |
10
|
Ellen M. Voorhees, Claudia Leacock, and Geoffrey Towell. Learning context to disambiguate word senses. In Proceedings of the 3rd Computational Learning Theory and Natural Learning Systems Conference, 1992. Proceedings to appear. Also available as Siemens technical report.
|
| |
11
|
G.K. Zipf. The meaning-frequency relationship of words. Journal of General Psychology, 3:251-256, 1945.
|
CITED BY 65
|
|
|
|
|
|
|
|
Rila Mandala , Takenobu Tokunaga , Hozumi Tanaka, Combining multiple evidence from different types of thesaurus for query expansion, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, p.191-197, August 15-19, 1999, Berkeley, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
Yariv Aridor , David Carmel , Yoëlle S. Maarek , Aya Soffer , Ronny Lempel, Knowledge encapsulation for focused search from pervasive devices, Proceedings of the 10th international conference on World Wide Web, p.754-764, May 01-05, 2001, Hong Kong, Hong Kong
|
|
|
|
|
|
Makoto Nakashima , Keizo Sato , Yanhua Qu , Tetsuro Ito, Browsing-based conceptual information retrieval incorporating dictionary term relations, keyword association, and a user's interest, Journal of the American Society for Information Science and Technology, v.54 n.1, p.16-28, January 2003
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Troels Andreasen , Per Anker Jensen , Jørgen Fischer Nilsson , Patrizia Paggio , Bolette Sandford Pedersen , Hanne Erdman Thomsen, Content-based text querying with ontological descriptors, Data & Knowledge Engineering, v.48 n.2, p.199-219, February 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
William A. Woods , Lawrence A. Bookman , Ann Houston , Robert J. Kuhns , Paul Martin , Stephen Green, Linguistic knowledge can improve information retrieval, Proceedings of the sixth conference on Applied natural language processing, p.262-267, April 29-May 04, 2000, Seattle, Washington
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jennifer Chu-Carroll , John Prager , Krzysztof Czuba , David Ferrucci , Pablo Duboue, Semantic search via XML fragments: a high-precision approach to IR, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
|
|
Ido Dagan , Oren Glickman , Alfio Gliozzo , Efrat Marmorshtein , Carlo Strapparava, Direct word sense matching for lexical substitution, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.449-456, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Soumen Chakrabarti , Byron Dom , Rakesh Agrawal , Prabhakar Raghavan, Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases, Proceedings of the 23rd International Conference on Very Large Data Bases, p.446-455, August 25-29, 1997
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xiaobin Li , Stan Szpakowicz , Stan Matwin, A WordNet-based algorithm for word sense disambiguation, Proceedings of the 14th international joint conference on Artificial intelligence, p.1368-1374, August 20-25, 1995, Montreal, Quebec, Canada
|
|