| Mapping enterprise entities to text segments |
| Full text |
Pdf
(315 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 2nd PhD workshop on Information and knowledge management
table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session
table of contents
Pages 85-88
Year of Publication: 2008
ISBN:978-1-60558-257-3
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 75, Citation Count: 0
|
|
|
ABSTRACT
Today, valuable business information is increasingly stored as unstructured data (documents, emails, etc.). For example, documents exchanged between business partners capture information on transactions between them like purchases or invoices. A major challenge is to correctly recognize and associate real-world entities in unstructured data, e.g. documents, with those stored in structured data e.g., enterprise databases. To address this, we propose in this paper a robust process methodology consisting of three phases: entity extraction from documents, generation of mapping of recognized entities with structured data, and disambiguation of mappings exploiting relationships from the enterprise data and the documents' structure.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
Venkatesan T. Chakaravarthy , Vinayaka Pandit , Sambuddha Roy , Pranjal Awasthi , Mukesh Mohania, Decision trees for entity identification: approximation algorithms and hardness results, Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 11-13, 2007, Beijing, China
[doi> 10.1145/1265530.1265538]
|
| |
4
|
Hassell, J., Aleman-Meza, B., and Arpinar, I. B. Ontology-driven automatic entity disambiguation in unstructured text. In Proc. ISWC (2006), pp. 44--57.
|
 |
5
|
|
| |
6
|
|
| |
7
|
Michelson, M., and Knoblock, C. A. Beginning to understand unstructured, ungrammatical text: An information integration approach. In Proc. of the AAAI Spring Symposium on Machine Reading (2007).
|
| |
8
|
Nadeau, David, Sekine, and Satoshi. A survey of named entity recognition and classification. Linguisticae Investigationes 30, 1 (2007), 3--26.
|
|