ACM Home Page
Please provide us with feedback. Feedback
SOFIE: a self-organizing framework for information extraction
Full text PdfPdf (777 KB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Semantic/data web/session: linked data table of contents
Pages 631-640  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Fabian M. Suchanek  Max-Planck Institute for Informatics, Saarbruecken, Germany
Mauro Sozio  Max-Planck Institute for Informatics, Saarbruecken, Germany
Gerhard Weikum  Max-Planck Institute for Informatics, Saarbruecken, Germany
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 29,   Downloads (12 Months): 175,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526794
What is a DOI?

ABSTRACT

This paper presents SOFIE, a system for automated ontology extension. SOFIE can parse natural language documents, extract ontological facts from them and link the facts into an ontology. SOFIE uses logical reasoning on the existing knowledge and on the new knowledge in order to disambiguate words to their most probable meaning, to reason on the meaning of text patterns and to take into account world knowledge axioms. This allows SOFIE to check the plausibility of hypotheses and to avoid inconsistencies with the ontology. The framework of SOFIE unites the paradigms of pattern matching, word sense disambiguation and ontological reasoning in one unified model. Our experiments show that SOFIE delivers high-quality output, even from unstructured Internet documents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Approximating the value of two power proof systems, with applications to max 2sat and max dicut. In ISTCS 1995.
2
 
3
 
4
S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, Z. G. Ives. Dbpedia: A nucleus for a Web of open data.% In ISWC, volume 4825 of LNCS, pages 722--735. Springer, 2007. In ISWC 2007.
 
5
M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, O. Etzioni. Open information extraction from the Web. In IJCAI 2007.
6
 
7
M. Battiti, R. Protasi. Approximate algorithms and solutions for Max SAT. In G. Xue, editor, Handbook of Combinatorial Optimization DingZhu Du and Panos M. Pardalos (co-editors), Kluwer Academic Publishers, 1998, Vols. 1--3, ISBN: 0-7923-5285-8, volume 19, pages 425--430, 2001. , Kluwer, 2001.
 
8
 
9
 
10
S. Blohm, P. Cimiano, E. Stemle. Harvesting relations from the Web-quantifiying the impact of filtering functions. In Proceedings of the 22nd Conference on Artificial Intelligence (AAAI-07), pages 1316--1323. Association for the Advancement of Artificial Intelligence (AAAI), JUL 2007. In AAAI 2007.
 
11
 
12
13
 
14
V. de Boer, M. van Someren, B. J. Wielinga. Extracting instances of relations from Web documents using redundancy. In ESWC 2006.
 
15
 
16
 
17
O. Etzioni, M. Banko, M. J. Cafarella. Machine reading. In AAAI 2006.
18
 
19
C. Fellbaum, editor. WordNet: An Electronic Lexical Database. MIT Press, 1998.
 
20
 
21
 
22
 
23
 
24
D. S. Johnson. Approximation algorithms for combinatorial problems. J. Comput. Syst. Sci., 9(3):256--278, 1974.
 
25
 
26
 
27
28
 
29
S. P. Ponzetto and M. Strube. Deriving a large-scale taxonomy from Wikipedia. In AAAI, 2007.
 
30
H. Poon and P. Domingos. Joint inference in information extraction. In AAAI, 2007.
 
31
 
32
 
33
F. Reiss, S. Raghavan, R. Krishnamurthy, H. Zhu, S. Vaithyanathan. An algebraic approach to rule-based information extraction. In ICDE 2008.
 
34
 
35
 
36
 
37
S. Staab and R. Studer, editors. Handbook on Ontologies, 2nd edition. Springer, 2008.
 
38
F. M. Suchanek. Automated Construction and Growth of a Large Ontology. PhD thesis, Saarland University, Germany, 2008.
39
40
 
41
 
42
43
 
44
G. Wang, Y. Yu, H. Zhu. Pore: Positive-only relation extraction from Wikipedia text. In ISWC, 2007.
 
45
N. Weber and P. Buitelaar. Web-based ontology learning with Isolde. In ISWC Workshop on WebContent Mining with Human Language Technologies 2006.
46
47
 
48
 
49

Collaborative Colleagues:
Fabian M. Suchanek: colleagues
Mauro Sozio: colleagues
Gerhard Weikum: colleagues