ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Automatic acquisition of hyponyms from large text corpora
Full text Publisher SitePublisher Site PdfPdf (503 KB)
Source International Conference On Computational Linguistics archive
Proceedings of the 14th conference on Computational linguistics - Volume 2 table of contents
Nantes, France
SESSION: Large-scale resources table of contents
Pages: 539 - 545  
Year of Publication: 1992
Author
Marti A. Hearst  University of California, Berkeley Berkeley, CA
Sponsors
: Universités de Grenoble
: Conseil Général de Loire Atlantique
: IMAG
: CNRS
: SITE
: Ministère de la recherche et de la technologie
: Ministère des Affairs Étrangères
: Université de Nantes
: AFCET
: ATALA
: Ville de Nantes
: ACL
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 25,   Downloads (12 Months): 297,   Citation Count: 229
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/992133.992154

Warning: The download time has expired please click on the item to try again.


ABSTRACT

We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate the approach: (i) avoidance of the need for pre-encoded knowledge and (ii) applicability across a wide range of text. We identify a set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest. We describe a method for discovering these patterns and suggest that other lexical relations will also be acquirable in this way. A subset of the acquisition algorithm is implemented and the results are used to augment and critique the structure of a large hand-built thesaurus. Extensions and applications to areas such as information retrieval are suggested.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
Coates-Stephens, S. (1991). Coping with lexical inadequacy - the automatic acquisition of proper nouns from news text. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, pages 154--169, Oxford.
 
7
 
8
Grolier (1990). Academic American Encyclopedia Grolier Electronic Publishing, Danbury, Connecticut.
 
9
Hearst, M. A. (1991). Noun homograph disambiguation using local context in large text corpora. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, Oxford.
 
10
 
11
Jacobs, P. & U. Zernik (1988). Acquiring lexical knowledge from text: A case study. In Proceedings of AAAI88, pages 739--744.
 
12
 
13
 
14
Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross, & K. J. Miller (1990). Introduction to wordnet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.
 
15
 
16
 
17
 
18
 
19
Wilks, Y. A., D. C. Fass, C. ming Guo, J. E. McDonald, T. Plate, & B. M. Slator (1990). Providing machine tractable dictionary tools. Journal of Machine Translation, 2.

CITED BY  229