| Knowledge-based extraction of named entities |
| Full text |
Pdf
(125 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the eleventh international conference on Information and knowledge management
table of contents
McLean, Virginia, USA
SESSION: Information retrieval
table of contents
Pages: 532 - 537
Year of Publication: 2002
ISBN:1-58113-492-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 12, Downloads (12 Months): 68, Citation Count: 4
|
|
|
ABSTRACT
The usual approach to named-entity detection is to learn extraction rules that rely on linguistic, syntactic, or document format patterns that are consistent across a set of documents. However, when there is no consistency among documents, it may be more effective to learn document-specific extraction rules.This paper presents a knowledge-based approach to learning rules for named-entity extraction. Document-specific extraction rules are created using a generate-and-test paradigm and a database of known named-entities. Experimental results show that this approach is effective on Web documents that are difficult for the usual methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Daniel M. Bikel , Scott Miller , Richard Schwartz , Ralph Weischedel, Nymble: a high-performance learning name-finder, Proceedings of the fifth conference on Applied natural language processing, p.194-201, March 31-April 03, 1997, Washington, DC
[doi> 10.3115/974557.974586]
|
| |
2
|
|
| |
3
|
N. Chincor. MUC-7 named entity task definition dry run version, version 3.5. In Proceedings of the Seventh Message Understanding Conference (MUC-7). Morgan Kaufmann, 1998.
|
| |
4
|
W. Cohen, A. McCallum, and D. Quass. Learning to understand the Web. IEEE Data Engineering Bulletin, 23(3):17--24, 2000.
|
| |
5
|
|
| |
6
|
G. Hoff. HomePageSearch. http://hpsearch.uni-trier.de/. University of Trier, 2002.
|
| |
7
|
T. Kitani and T. Mitamura. An accurate morphological analysis and proper name identification for Japanese text processing. Journal of Information Processing Society of Japan, 35(3):404--413, 1994.
|
| |
8
|
|
| |
9
|
L. F. Rau. Extracting company names from text. In Proceedings of the Sixth IEEE Conference on Artificial Intelligence Applications, 1991.
|
| |
10
|
K. Seymore, A. McCallum, and R. Rosenfeld. Learning hidden markov model structure for information extraction. In AAAI'99 Workshop on Machine Learning for Information Extraction, 1999.
|
| |
11
|
|
CITED BY 4
|
|
|
|
|
|
|
|
|
|
|
Wei Zhang , Shuang Liu , Clement Yu , Chaojing Sun , Fang Liu , Weiyi Meng, Recognition and classification of noun phrases in queries for effective retrieval, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, November 06-10, 2007, Lisbon, Portugal
|
|