ACM Home Page
Please provide us with feedback. Feedback
Automatic acquisition of hyponyms from large text corpora
Full text PdfPdf (503 KB)
Source International Conference On Computational Linguistics archive
Proceedings of the 14th conference on Computational linguistics - Volume 2 table of contents
Nantes, France
SESSION: Large-scale resources table of contents
Pages: 539 - 545  
Year of Publication: 1992
Author
Marti A. Hearst  University of California, Berkeley Berkeley, CA
Sponsors
: Universités de Grenoble
: Conseil Général de Loire Atlantique
: IMAG
: CNRS
: SITE
: Ministère de la recherche et de la technologie
: Ministère des Affairs Étrangères
: Université de Nantes
: AFCET
: ATALA
: Ville de Nantes
: ACL
Publisher
Association for Computational Linguistics  Morristown, NJ, USA
Bibliometrics
Downloads (6 Weeks): 26,   Downloads (12 Months): 238,   Citation Count: 134
Additional Information:

abstract   references   cited by   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
DOI Bookmark: 10.3115/992133.992154

ABSTRACT

We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate the approach: (i) avoidance of the need for pre-encoded knowledge and (ii) applicability across a wide range of text. We identify a set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest. We describe a method for discovering these patterns and suggest that other lexical relations will also be acquirable in this way. A subset of the acquisition algorithm is implemented and the results are used to augment and critique the structure of a large hand-built thesaurus. Extensions and applications to areas such as information retrieval are suggested.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
 
6
Coates-Stephens, S. (1991). Coping with lexical inadequacy - the automatic acquisition of proper nouns from news text. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, pages 154--169, Oxford.
 
7
 
8
Grolier (1990). Academic American Encyclopedia Grolier Electronic Publishing, Danbury, Connecticut.
 
9
Hearst, M. A. (1991). Noun homograph disambiguation using local context in large text corpora. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, Oxford.
 
10
 
11
Jacobs, P. & U. Zernik (1988). Acquiring lexical knowledge from text: A case study. In Proceedings of AAAI88, pages 739--744.
 
12
 
13
 
14
Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross, & K. J. Miller (1990). Introduction to wordnet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.
 
15
 
16
 
17
 
18
 
19
Wilks, Y. A., D. C. Fass, C. ming Guo, J. E. McDonald, T. Plate, & B. M. Slator (1990). Providing machine tractable dictionary tools. Journal of Machine Translation, 2.

CITED BY  135
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Peer to Peer - Readers of this Article have also read: