| Semi-automatic ontology extraction to create draft topic maps |
| Full text |
Pdf
(221 KB)
|
| Source
|
ACM Southeast Regional Conference
archive
Proceedings of the 45th annual southeast regional conference
table of contents
Winston-Salem, North Carolina
Pages: 100 - 105
Year of Publication: 2007
ISBN:978-1-59593-629-5
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 99, Citation Count: 0
|
|
|
ABSTRACT
Topic maps are a Semantic Web technology that provides a human-oriented mechanism to encode knowledge by organizing web information around topics. Studies have shown, however, that authors face major difficulties in constructing topic maps. This paper discusses an approach to automatic construction of a "draft" topic map for the authors to start with. The idea is to extract topic map constructs by crawling a website and parsing its pages. We propose a set of heuristics that can be used for extracting semantic information from the HTML markup of the web pages. We have used this approach to design and implement a plug-in for the topic map editor TM4L that automatically extracts topics and relationships from a website specified by the author. An evaluation of the proposed approach in terms of Recall and Precision of the extracted data is presented.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Topic Maps-Based Digital Libraries, http://compsci.wssu.edu/iis/nsdl/ {last seen 12/01/2006}
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
Vargas-Vera, M. and Motta, E. and Domingue, J. and Shum, S. B. and Lanzoni, M. (2000). Knowledge Extraction by using an Ontology-based Annotation Tool. White paper by Association for Computing Machinery. http://kmi.open.ac.uk/projects/akt/publication-pdf/vargassaw. pdf {last seen 11/05/2006}
|
 |
6
|
|
| |
7
|
Buitelaar, P. and Olejnik, D. and Sintek, M. (2004). A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis. In Proc. of the 1st European Semantic Web Symposium. http://dfki.de/~paulb/esws04.pdf {last seen 11/04/2006}
|
| |
8
|
The Protégé Ontology Editor and Knowledge Acquisition System, http://smi.stanford.edu/projects/protege/index.html. {last seen 12/01/2006}
|
| |
9
|
Fortuna, B. and Mladinic, D. and Grobelnik, M. (2006). System for Semi-Automatic Ontology Construction. In Proc. of the 3rd European Semantic Web Symposium. http://www.eswc2006.org/demo-papers/FD18-Fortuna.pdf {last seen 11/04/2006}
|
| |
10
|
|
| |
11
|
TM4J - Topic Maps for Java, http://tm4j.org/ {last seen 12/01/2006}
|
| |
12
|
Open Source Crawlers in Java - WebSPHINX, http://www.java-source.net/open-source/crawlers/websphinx. {last seen 12/01/2006}
|
| |
13
|
|
| |
14
|
Roberson, S. E. (2006). Semi-Automatic Ontology Extraction to Create Template Topic Maps. Master Thesis, Winston-Salem State University, Winston-Salem, NC.
|
|