|
Warning: The download time has expired please click on the item to try again.
ABSTRACT
We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate the approach: (i) avoidance of the need for pre-encoded knowledge and (ii) applicability across a wide range of text. We identify a set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest. We describe a method for discovering these patterns and suggest that other lexical relations will also be acquirable in this way. A subset of the acquisition algorithm is implemented and the results are used to augment and critique the structure of a large hand-built thesaurus. Extensions and applications to areas such as information retrieval are suggested.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
Coates-Stephens, S. (1991). Coping with lexical inadequacy - the automatic acquisition of proper nouns from news text. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, pages 154--169, Oxford.
|
| |
7
|
|
| |
8
|
Grolier (1990). Academic American Encyclopedia Grolier Electronic Publishing, Danbury, Connecticut.
|
| |
9
|
Hearst, M. A. (1991). Noun homograph disambiguation using local context in large text corpora. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, Oxford.
|
| |
10
|
|
| |
11
|
Jacobs, P. & U. Zernik (1988). Acquiring lexical knowledge from text: A case study. In Proceedings of AAAI88, pages 739--744.
|
| |
12
|
|
| |
13
|
|
| |
14
|
Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross, & K. J. Miller (1990). Introduction to wordnet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
Wilks, Y. A., D. C. Fass, C. ming Guo, J. E. McDonald, T. Plate, & B. M. Slator (1990). Providing machine tractable dictionary tools. Journal of Machine Translation, 2.
|
CITED BY 229
|
|
|
|
|
Joanna Golebiowska , Rose Dieng-Kuntz , Olivier Corby , Didier Mousseau, Building and exploiting ontologies for an automobile project memory, Proceedings of the 1st international conference on Knowledge capture, October 22-23, 2001, Victoria, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Leah S. Larkey , Paul Ogilvie , M. Andrew Price , Brenden Tamilio, Acrophile: an automated acronym extractor and server, Proceedings of the fifth ACM conference on Digital libraries, p.205-214, June 02-07, 2000, San Antonio, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Agneta Bergström , Patricija Jaksetic , Peter Nordin, Enhancing information retrieval by automatic acquisition of textual relations using genetic programming, Proceedings of the 5th international conference on Intelligent user interfaces, p.29-32, January 09-12, 2000, New Orleans, Louisiana, United States
|
|
|
Oren Etzioni , Michael Cafarella , Doug Downey , Stanley Kok , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Web-scale information extraction in knowitall: (preliminary results), Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hiroyuki Kaji , Yasutsugu Morimoto , Toshiko Aizono , Noriyuki Yamasaki, Corpus-dependent association thesauri for information retrieval, Proceedings of the 18th conference on Computational linguistics, p.404-410, July 31-August 04, 2000, Saarbrücken, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Oren Etzioni , Michael Cafarella , Doug Downey , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Unsupervised named-entity extraction from the web: an experimental study, Artificial Intelligence, v.165 n.1, p.91-134, June 2005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Eric Glover , David M. Pennock , Steve Lawrence , Robert Krovetz, Inferring hierarchical descriptions, Proceedings of the eleventh international conference on Information and knowledge management, November 04-09, 2002, McLean, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael J. Cafarella , Doug Downey , Stephen Soderland , Oren Etzioni, KnowItNow: fast, scalable information extraction from the web, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.563-570, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
Dmitri Roussinov , Leon J. Zhao , Weiguo Fan, Mining context specific similarity relationships using the world wide web, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.499-506, October 06-08, 2005, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ben Carterette , Rosie Jones , Wiley Greiner , Cory Barr, N semantic classes are harder than two, Proceedings of the COLING/ACL on Main conference poster sessions, p.49-56, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Enrique Alfonseca , Pablo Castells , Manabu Okumura , Maria Ruiz-Casado, A rote extractor with edit distance-based generalisation and multi-corpora precision calculation, Proceedings of the COLING/ACL on Main conference poster sessions, p.9-16, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Risto Gligorov , Warner ten Kate , Zharko Aleksovski , Frank van Harmelen, Using Google distance to weight approximate ontology matches, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Saurav Sahay , Sougata Mukherjea , Eugene Agichtein , Ernest V. Garcia , Shamkant B. Navathe , Ashwin Ram, Discovering semantic biomedical relations utilizing the Web, ACM Transactions on Knowledge Discovery from Data (TKDD), v.2 n.1, p.1-15, March 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Dou Shen , Min Qin , Weizhu Chen , Qiang Yang , Zheng Chen, Mining web query hierarchies from clickthrough data, Proceedings of the 22nd national conference on Artificial intelligence, p.341-346, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tomas Kliegr , Krishna Chandramouli , Jan Nemrava , Vojtech Svatek , Ebroul Izquierdo, Combining image captions and visual analysis for image concept classification, Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008, p.8-17, August 24-24, 2008, Las Vegas, Nevada
|
|
|
|
|
|
Oren Etzioni , Michael Cafarella , Doug Downey , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Methods for domain-independent information extraction from the web: an experimental comparison, Proceedings of the 19th national conference on Artifical intelligence, p.391-398, July 25-29, 2004, San Jose, California
|
|
|
|
|
|
|
|
|
|
|
|
Vivi Nastase , Jelber Sayyad-Shirabad , Marina Sokolova , Stan Szpakowicz, Learning noun-modifier semantic relations with corpus-based and WordNet-based features, Proceedings of the 21st national conference on Artificial intelligence, p.781-786, July 16-20, 2006, Boston, Massachusetts
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Adam Przepiórkowski , Łukasz Degórski , Beata Wójtowicz , Miroslav Spousta , Vladislav Kuboň , Kiril Simov , Petya Osenova , Lothar Lemnitzer, Towards the automatic extraction of definitions in Slavic, Proceedings of the Workshop on Balto-Slavonic Natural Language Processing: Information Extraction and Enabling Technologies, June 29-29, 2007, Prague, Czech Republic
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Leonardo Rigutini , Ernesto Di Iorio , Marco Ernandes , Marco Maggini, Automatic term categorization by extracting knowledge from the Web, Proceeding of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy, p.531-535, May 22, 2006
|
|
|
T. Agnoloni , L. Bacci , E. Francesconi , W. Peters , S. Montemagni , G. Venturi, A two-level knowledge approach to support multilingual legislative drafting, Proceeding of the 2009 conference on Law, Ontologies and the Semantic Web: Channelling the Legal Information Flood, p.177-198, July 08, 2009
|
|
|
|
|
|
Anjo Anjewierden , Hannie Gijlers , Bas Kolloffel , Nadira Saab , Robert De Hoog, Examining the relation between domain-related communication and collaborative inquiry learning, Proceedings of the 9th international conference on Computer supported collaborative learning, p.123-131, June 08-13, 2009, Rhodes, Greece
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yutaka Matsuo , Hironori Tomobe , Takuichi Nishimura, Robust estimation of Google counts for social network extraction, Proceedings of the 22nd national conference on Artificial intelligence, p.1395-1401, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
Sebastian Blohm , Philipp Cimiano , Egon Stemle, Harvesting relations from the web: quantifiying the impact of filtering functions, Proceedings of the 22nd national conference on Artificial intelligence, p.1316-1321, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Albert Weichselbraun , Gerhard Wohlgenannt , Arno Scharl , Michael Granitzer , Thomas Neidhart , Andreas Juffinger, Discovery and evaluation of non-taxonomic relations in domain ontologies, International Journal of Metadata, Semantics and Ontologies, v.4 n.3, p.212-222, August 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Cheng-Yu Lu , Shian-Hua Lin , Jen-Chang Liu , Samuel Cruz-Lara , Jen-Shin Hong, Automatic event-level textual emotion sensing using mutual action histogram between entities, Expert Systems with Applications: An International Journal, v.37 n.2, p.1643-1653, March, 2010
|
|
|
|
|
|
|
|
|
Eiji Aramaki , Takeshi Imai , Kengo Miyo , Kazuhiko Ohe, UTH: SVM-based semantic relation classification using physical sizes, Proceedings of the 4th International Workshop on Semantic Evaluations, p.464-467, June 23-24, 2007, Prague, Czech Republic
|
|
|
|
|
|
|
|
|
Andrew Carlson , Justin Betteridge , Estevam R. Hruschka, Jr. , Tom M. Mitchell, Coupling semi-supervised learning of categories and relations, Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, p.1-9, June 04-04, 2009, Boulder, Colorado
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Roxana Girju , Preslav Nakov , Vivi Nastase , Stan Szpakowicz , Peter Turney , Deniz Yuret, SemEval-2007 task 04: classification of semantic relations between nominals, Proceedings of the 4th International Workshop on Semantic Evaluations, p.13-18, June 23-24, 2007, Prague, Czech Republic
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Partha Pratim Talukdar , Joseph Reisinger , Marius Paşca , Deepak Ravichandran , Rahul Bhagat , Fernando Pereira, Weakly-supervised acquisition of labeled class instances using graph random walks, Proceedings of the Conference on Empirical Methods in Natural Language Processing, October 25-27, 2008, Honolulu, Hawaii
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Iris Hendrickx , Su Nam Kim , Zornitsa Kozareva , Preslav Nakov , Diarmuid Ó Séaghdha , Sebastian Padó , Marco Pennacchiotti , Lorenza Romano , Stan Szpakowicz, SemEval-2010 task 8: multi-way classification of semantic relations between pairs of nominals, Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, June 04-04, 2009, Boulder, Colorado
|
|
|
|
|
|
Oren Etzioni , Michael Cafarella , Doug Downey , Ana-Maria Popescu , Tal Shaked , Stephen Soderland , Daniel S. Weld , Alexander Yates, Unsupervised named-entity extraction from the Web: An experimental study, Artificial Intelligence, v.165 n.1, p.91-134, June 2005
|
|
|
|
|
|
|
|
|
K. Chandramouli , T. Kliegr , V. Svatek , E. Izquierdo, Towards semantic tagging in collaborative environments, Proceedings of the 16th international conference on Digital Signal Processing, p.248-253, July 05-07, 2009, Santorini, Greece
|
|