ACM Home Page
Please provide us with feedback. Feedback
Semi-automatic ontology extraction to create draft topic maps
Full text PdfPdf (221 KB)
Source ACM Southeast Regional Conference archive
Proceedings of the 45th annual southeast regional conference table of contents
Winston-Salem, North Carolina
SESSION: Papers table of contents
Pages: 100 - 105  
Year of Publication: 2007
ISBN:978-1-59593-629-5
Authors
Steven Roberson  Winston-Salem State University, Winston-Salem, NC
Darina Dicheva  Winston-Salem State University, Winston-Salem, NC
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 101,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1233341.1233360
What is a DOI?

ABSTRACT

Topic maps are a Semantic Web technology that provides a human-oriented mechanism to encode knowledge by organizing web information around topics. Studies have shown, however, that authors face major difficulties in constructing topic maps. This paper discusses an approach to automatic construction of a "draft" topic map for the authors to start with. The idea is to extract topic map constructs by crawling a website and parsing its pages. We propose a set of heuristics that can be used for extracting semantic information from the HTML markup of the web pages. We have used this approach to design and implement a plug-in for the topic map editor TM4L that automatically extracts topics and relationships from a website specified by the author. An evaluation of the proposed approach in terms of Recall and Precision of the extracted data is presented.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Topic Maps-Based Digital Libraries, http://compsci.wssu.edu/iis/nsdl/ {last seen 12/01/2006}
 
2
 
3
 
4
 
5
Vargas-Vera, M. and Motta, E. and Domingue, J. and Shum, S. B. and Lanzoni, M. (2000). Knowledge Extraction by using an Ontology-based Annotation Tool. White paper by Association for Computing Machinery. http://kmi.open.ac.uk/projects/akt/publication-pdf/vargassaw. pdf {last seen 11/05/2006}
6
 
7
Buitelaar, P. and Olejnik, D. and Sintek, M. (2004). A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis. In Proc. of the 1st European Semantic Web Symposium. http://dfki.de/~paulb/esws04.pdf {last seen 11/04/2006}
 
8
The Protégé Ontology Editor and Knowledge Acquisition System, http://smi.stanford.edu/projects/protege/index.html. {last seen 12/01/2006}
 
9
Fortuna, B. and Mladinic, D. and Grobelnik, M. (2006). System for Semi-Automatic Ontology Construction. In Proc. of the 3rd European Semantic Web Symposium. http://www.eswc2006.org/demo-papers/FD18-Fortuna.pdf {last seen 11/04/2006}
 
10
 
11
TM4J - Topic Maps for Java, http://tm4j.org/ {last seen 12/01/2006}
 
12
Open Source Crawlers in Java - WebSPHINX, http://www.java-source.net/open-source/crawlers/websphinx. {last seen 12/01/2006}
 
13
 
14
Roberson, S. E. (2006). Semi-Automatic Ontology Extraction to Create Template Topic Maps. Master Thesis, Winston-Salem State University, Winston-Salem, NC.

Collaborative Colleagues:
Steven Roberson: colleagues
Darina Dicheva: colleagues