ACM Home Page
Please provide us with feedback. Feedback
One sense per discourse
Full text PdfPdf (472 KB)
Source Human Language Technology Conference archive
Proceedings of the workshop on Speech and Natural Language table of contents
Harriman, New York
SESSION: Lexicon and lexical semantics table of contents
Pages: 233 - 237  
Year of Publication: 1992
ISBN:1-55860-272-0
Authors
William A. Gale  AT&T Bell Laboratories, Murray Hill NJ
Kenneth W. Church  AT&T Bell Laboratories, Murray Hill NJ
David Yarowsky  AT&T Bell Laboratories, Murray Hill NJ
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 43,   Citation Count: 63
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/1075527.1075579

ABSTRACT

It is well-known that there are polysemous words like sentence whose "meaning" or "sense" depends on the context of use. We have recently reported on two new word-sense disambiguation systems, one trained on bilingual material (the Canadian Hansards) and the other trained on monolingual material (Roget's Thesaurus and Grolier's Encyclopedia). As this work was nearing completion, we observed a very strong discourse effect. That is, if a polysemous word such as sentence appears two or more times in a well-written discourse, it is extremely likely that they will all share the same sense. This paper describes an experiment which confirmed this hypothesis and found that the tendency to share sense in the same discourse is extremely strong (98%). This result can be used as an additional source of constraint for improving the performance of the word-sense disambiguation algorithm. In addition, it could also be used to help evaluate disambiguation algorithms that did not make use of the discourse constraint.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Chapman, Robert (1977). Roget's International Thesaurus (Fourth Edition), Harper and Row, New York.
 
4
 
5
Gale, Church, and Yarowsky, 1992, "Discrimination Decisions for 100,000-Dimensional Spaces" AT&T Statistical Research Report No. 103.
 
6
Grolier's Inc. (1991) New Grolier's Electronic Encyclopedia.
 
7
 
8
Kelly, Edward, and Phillip Stone (1975), Computer Recognition of English Word Senses, North-Holland, Amsterdam.
 
9
Mosteller, Fredrick, and David Wallace (1964) Inference and Disputed Authorship: The Federalist, Addison-Wesley, Reading, Massachusetts.
 
10
 
11

CITED BY  63
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Collaborative Colleagues:
William A. Gale: colleagues
Kenneth W. Church: colleagues
David Yarowsky: colleagues