|
ABSTRACT
Lexical ambiguity is a pervasive problem in natural language processing. However, little quantitative information is available about the extent of the problem or about the impact that it has on information retrieval systems. We report on an analysis of lexical ambiguity in information retrieval test collections and on experiments to determine the utility of word meanings for separating relevant from nonrelevant documents. The experiments show that there is considerable ambiguity even in a specialized database. Word senses provide a significant separation between relevant and nonrelevant documents, but several factors contribute to determining whether disambiguation will make an improvement in performance. For example, resolving lexical ambiguity was found to have little impact on retrieval effectiveness for documents that have many words in common with the query. Other uses of word sense disambiguation in an information retrieval context are discussed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
ANTHONY, E. An Exploratory Inquiry into Lexical Clusters Am. Speech, 29, 3 (1954), 175-180
|
| |
3
|
ATKINS, B. Semantic ID Tags: Corpus evidence for dmtionary senses In Proceedings of the Third Annual Conference of the UW Centre for the New Oxford English Dtcttonary~ (1987), pp. 17-36.
|
| |
4
|
BECK~R, B. Sense dmambiguatlon using the the New Oxford English D~ctionary. Masters thesis, Univ. of Waterloo, 1989.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
CHOUEKA, Y., AND LUS~C~NAN, S Discmminatlon by short contexts Comput. Hum 19, (1985), 147-157.
|
| |
9
|
COTTRELL, G., AND SMALL, S. A connectiomst scheme for modeling word sense disambiguation. Cognition and Bra~n Theory, 6, I (1983), 89-120.
|
| |
10
|
CROFT~ W B. Experiments with Representation in a Document Retmeval System Inf. Tech. Res. Dev. 2, (1983), 1-21.
|
| |
11
|
CULLINGFORD, R., AND PAZZANI, M. Word-meaning selection in multiprocess language understanding programs. IEEE Trans Patter, Anal Moch Intell 6, 4 (1984), 493-509
|
| |
12
|
|
| |
13
|
EARL, L. Use of word government in resolving syntactic and semantic ambiguities. Inf. Storage Retrieval. 9, (1973), 639-664.
|
 |
14
|
|
| |
15
|
|
| |
16
|
HAYES, P. Some association-based techniques for texical disambiguation by machine Ph.D Disseration, published as Tech. Rep. 25, Dept. of Computer Science, Univ. of Rochester, 1977.
|
| |
17
|
HELM, S. Closer than you think. Medicine Comput. 1, i (1983)
|
| |
18
|
HmsT, G Resolving lexical ambiguity computationally with spreading activation and polaroid words. In Lexical Ambiguity Resolution, S. Small, G. Cottrell, and M Tannenhaus Eds., Morgan Kaufmann, Pato Alto, Calif., 1988.
|
| |
19
|
KEEN, E. An analysis of the documentation requests. In The Smart Retrteval System, G Salton, Ed., Prentice-Hall, Englewood, N.J, 1971.
|
| |
20
|
KELLY, E., AND STONE, P. Computer Recogn~twn of English Word Senses. North-Holland, Amsterdam, 1975
|
| |
21
|
KROVETZ, R. Lexical acquisition and information retrieval. In Lexical Acquisition' Build~ ~ng the Lexzcon using On-Line Resources, U. Zernik Ed., LEA Press~ 1991.
|
 |
22
|
|
 |
23
|
|
| |
24
|
MASTERMAN, M., NEEDHAM, R. M., SPARCK-JONES, K., AND MAYOH, B. Agricola incurvo terram dlmovit aratro. Rep. ML84, Cambridge Language Research Unit, 1957, Reprinted 1986.
|
| |
25
|
PROCTOR, P Longman Dictzonali~ of Contemporary English. Longman, 1978.
|
| |
26
|
|
| |
27
|
|
| |
28
|
SEDLOW, S., AND MOONEY, D. Knowledge retrieval from expert systems: II. Research results In Proceedings of the 51st Annual Meeting of the American Society of Information Science, (1988), pp. 209-212.
|
| |
29
|
|
| |
30
|
SMALL, S., AND RINGER, C. Parsing and comprehending with word experts (a theory and its realization). In Strategies for Natural Language Processing, Lehnert and Ringle Eds. LEA Press, 1982, 89-147.
|
| |
31
|
|
| |
32
|
SPARCK-JONES, K., AND TAIT, J. Automatic search term variant generation. J. Doc. 40, 1 (1984), 50-66.
|
| |
33
|
|
| |
34
|
WEISS, S Learning to disambiguate. Inf. Storage Retrieval, 9, (1973), 33-41.
|
| |
35
|
Y. Wilks , D. Fass , C. Guo , J. McDonald , T. Plate , B. Slator, A tractable machine dictionary as a resource for computational semantics, Computational lexicography for natural language processing, Longman Publishing Group, White Plains, NY, 1989
|
| |
36
|
ZIPF, G. The meaning-frequency relationship of words. J. Gen Psycho 33, (1945), 251 266.
|
CITED BY 51
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Robert B. Allen , Pascal Obry , Michael Littman, An interface for navigating clustered document sets returned by queries, Proceedings of the conference on Organizational computing systems, p.166-171, November 01-04, 1993, Milpitas, California, United States
|
|
|
|
|
|
W. Bruce Croft , Howard R. Turtle , David D. Lewis, The use of phrases and structured queries in information retrieval, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.32-45, October 13-16, 1991, Chicago, Illinois, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Anthony Ventresque , Sylvie Cazalens , Philippe Lamarre , Patrick Valduriez, Dealing with P2P semantic heterogeneity through query expansion and interpretation, Proceedings of the 2008 international workshop on Data management in peer-to-peer systems, p.3-10, March 25-25, 2008, Nantes, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"Richard S. Marcus : Reviewer"
The authors consider ambiguity arising from words having multiple
senses. For example, the word “file” has the two senses,
“a thing that stores information” and “a thing that
scrapes wood or metal.” The aut
more...
|