ACM Home Page
Please provide us with feedback. Feedback
Knowledge-based extraction of named entities
Full text PdfPdf (125 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the eleventh international conference on Information and knowledge management table of contents
McLean, Virginia, USA
SESSION: Information retrieval table of contents
Pages: 532 - 537  
Year of Publication: 2002
ISBN:1-58113-492-4
Authors
Jamie Callan  Carnegie Mellon University, Pittsburgh, PA
Teruko Mitamura  Carnegie Mellon University, Pittsburgh, PA
Sponsors
SIGMIS: ACM Special Interest Group on Management Information Systems
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 68,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/584792.584880
What is a DOI?

ABSTRACT

The usual approach to named-entity detection is to learn extraction rules that rely on linguistic, syntactic, or document format patterns that are consistent across a set of documents. However, when there is no consistency among documents, it may be more effective to learn document-specific extraction rules.This paper presents a knowledge-based approach to learning rules for named-entity extraction. Document-specific extraction rules are created using a generate-and-test paradigm and a database of known named-entities. Experimental results show that this approach is effective on Web documents that are difficult for the usual methods.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
N. Chincor. MUC-7 named entity task definition dry run version, version 3.5. In Proceedings of the Seventh Message Understanding Conference (MUC-7). Morgan Kaufmann, 1998.
 
4
W. Cohen, A. McCallum, and D. Quass. Learning to understand the Web. IEEE Data Engineering Bulletin, 23(3):17--24, 2000.
 
5
 
6
G. Hoff. HomePageSearch. http://hpsearch.uni-trier.de/. University of Trier, 2002.
 
7
T. Kitani and T. Mitamura. An accurate morphological analysis and proper name identification for Japanese text processing. Journal of Information Processing Society of Japan, 35(3):404--413, 1994.
 
8
 
9
L. F. Rau. Extracting company names from text. In Proceedings of the Sixth IEEE Conference on Artificial Intelligence Applications, 1991.
 
10
K. Seymore, A. McCallum, and R. Rosenfeld. Learning hidden markov model structure for information extraction. In AAAI'99 Workshop on Machine Learning for Information Extraction, 1999.
 
11


Collaborative Colleagues:
Jamie Callan: colleagues
Teruko Mitamura: colleagues