ACM Home Page
Please provide us with feedback. Feedback
Lexical ambiguity and information retrieval
Full text PdfPdf (2.00 MB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 10 ,  Issue 2  (April 1992) table of contents
Pages: 115 - 141  
Year of Publication: 1992
ISSN:1046-8188
Authors
Robert Krovetz  Univ. of Massachusetts, Amherst
W. Bruce Croft  Univ. of Massachusetts, Amherst
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 95,   Citation Count: 51
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/146802.146810
What is a DOI?

ABSTRACT

Lexical ambiguity is a pervasive problem in natural language processing. However, little quantitative information is available about the extent of the problem or about the impact that it has on information retrieval systems. We report on an analysis of lexical ambiguity in information retrieval test collections and on experiments to determine the utility of word meanings for separating relevant from nonrelevant documents. The experiments show that there is considerable ambiguity even in a specialized database. Word senses provide a significant separation between relevant and nonrelevant documents, but several factors contribute to determining whether disambiguation will make an improvement in performance. For example, resolving lexical ambiguity was found to have little impact on retrieval effectiveness for documents that have many words in common with the query. Other uses of word sense disambiguation in an information retrieval context are discussed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
ANTHONY, E. An Exploratory Inquiry into Lexical Clusters Am. Speech, 29, 3 (1954), 175-180
 
3
ATKINS, B. Semantic ID Tags: Corpus evidence for dmtionary senses In Proceedings of the Third Annual Conference of the UW Centre for the New Oxford English Dtcttonary~ (1987), pp. 17-36.
 
4
BECK~R, B. Sense dmambiguatlon using the the New Oxford English D~ctionary. Masters thesis, Univ. of Waterloo, 1989.
 
5
 
6
 
7
 
8
CHOUEKA, Y., AND LUS~C~NAN, S Discmminatlon by short contexts Comput. Hum 19, (1985), 147-157.
 
9
COTTRELL, G., AND SMALL, S. A connectiomst scheme for modeling word sense disambiguation. Cognition and Bra~n Theory, 6, I (1983), 89-120.
 
10
CROFT~ W B. Experiments with Representation in a Document Retmeval System Inf. Tech. Res. Dev. 2, (1983), 1-21.
 
11
CULLINGFORD, R., AND PAZZANI, M. Word-meaning selection in multiprocess language understanding programs. IEEE Trans Patter, Anal Moch Intell 6, 4 (1984), 493-509
 
12
 
13
EARL, L. Use of word government in resolving syntactic and semantic ambiguities. Inf. Storage Retrieval. 9, (1973), 639-664.
14
 
15
 
16
HAYES, P. Some association-based techniques for texical disambiguation by machine Ph.D Disseration, published as Tech. Rep. 25, Dept. of Computer Science, Univ. of Rochester, 1977.
 
17
HELM, S. Closer than you think. Medicine Comput. 1, i (1983)
 
18
HmsT, G Resolving lexical ambiguity computationally with spreading activation and polaroid words. In Lexical Ambiguity Resolution, S. Small, G. Cottrell, and M Tannenhaus Eds., Morgan Kaufmann, Pato Alto, Calif., 1988.
 
19
KEEN, E. An analysis of the documentation requests. In The Smart Retrteval System, G Salton, Ed., Prentice-Hall, Englewood, N.J, 1971.
 
20
KELLY, E., AND STONE, P. Computer Recogn~twn of English Word Senses. North-Holland, Amsterdam, 1975
 
21
KROVETZ, R. Lexical acquisition and information retrieval. In Lexical Acquisition' Build~ ~ng the Lexzcon using On-Line Resources, U. Zernik Ed., LEA Press~ 1991.
22
23
 
24
MASTERMAN, M., NEEDHAM, R. M., SPARCK-JONES, K., AND MAYOH, B. Agricola incurvo terram dlmovit aratro. Rep. ML84, Cambridge Language Research Unit, 1957, Reprinted 1986.
 
25
PROCTOR, P Longman Dictzonali~ of Contemporary English. Longman, 1978.
 
26
 
27
 
28
SEDLOW, S., AND MOONEY, D. Knowledge retrieval from expert systems: II. Research results In Proceedings of the 51st Annual Meeting of the American Society of Information Science, (1988), pp. 209-212.
 
29
 
30
SMALL, S., AND RINGER, C. Parsing and comprehending with word experts (a theory and its realization). In Strategies for Natural Language Processing, Lehnert and Ringle Eds. LEA Press, 1982, 89-147.
 
31
 
32
SPARCK-JONES, K., AND TAIT, J. Automatic search term variant generation. J. Doc. 40, 1 (1984), 50-66.
 
33
 
34
WEISS, S Learning to disambiguate. Inf. Storage Retrieval, 9, (1973), 33-41.
 
35
 
36
ZIPF, G. The meaning-frequency relationship of words. J. Gen Psycho 33, (1945), 251 266.

CITED BY  51


REVIEW

"Richard S. Marcus : Reviewer"

The authors consider ambiguity arising from words having multiple senses. For example, the word “file” has the two senses, “a thing that stores information” and “a thing that scrapes wood or metal.” The aut  more...

Collaborative Colleagues:
Robert Krovetz: colleagues
W. Bruce Croft: colleagues