ACM Home Page
Please provide us with feedback. Feedback
An ontology-based cluster analysis framework
Full text PdfPdf (119 KB)
Source ACM International Conference Proceeding Series; Vol. 308 archive
Proceedings of the first international workshop on Ontology-supported business intelligence table of contents
Karlsruhe, Germany
Article No. 7  
Year of Publication: 2008
ISBN:978-1-60558-219-1
Authors
Paweł Lula  Cracow University of Economics, Kraków, Poland
Grażyna Paliwoda-Pękosz  Cracow University of Economics, Kraków, Poland
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 138,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1452567.1452574
What is a DOI?

ABSTRACT

The main objectives of this paper is to propose a conceptual and software environment in which different aspects of cluster analysis of ontology-based data could be studied. The ontology-based dataset has two core components: description of categories and description of objects and relationships between them. Similarity between objects is defined as an amalgamation function of taxonomic, relationship and attribute similarity. The different measures to calculate similarity can be used. Further research is needed in order to evaluate these measures. The creation of a software tool which allows for classification of ontology-based data and comprehensive analysis of results is essential for the research in the area of ontology-based data mining. Such a tool should be universal, extensible and open. The universality manifests itself in the possibility of processing any data sets described by OWL tailored to meet individual requirements. The system extensibility means that it can be enriched with new elements without the necessity of making changes in its main elements. The openness enables the communications with other data analysis systems. In the paper theoretical aspects of cluster analysis of ontology-based data sets are presented. Next, a framework of cluster analysis system is outlined. Finally, some technical details of the system implementation are discussed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Budanitsky, A., and Hirst, G. 2001. Semantic Distance in WordNet: An Experimental, Application-oriented Evaluation of Five Measures. Workshop on WordNet and Other Lexical Resources. In the North American Chapter of the Association for Computational Linguistics (NAACL-2000), Pittsburgh, PA.
 
2
 
3
Gruber, T. 1995. Toward Principles for the Design of Ontologies Used for Knowledge Sharing. In Formal Ontology in Conceptual Analysis and Knowledge Representation, N. Guarino and R. Poli (eds), Kluwer Academic Publishers.
 
4
 
5
Jaro, M. A. 1995. Probabilistic linkage of large public health data file. Statistics in Medicine 14, 491--498.
 
6
 
7
 
8
 
9
Schickel-Zuber, V., Faltings, B. 2007. OSS: A Semantic Similarity Function based on Hierarchical Ontologies. In IJCAI, 551--556.
 
10
Winkler, W. E. 1999. The state of record linkage and current research problems. Statistics of Income Division. Internal Revenue Service Publication R99/04.
 
11
Zhang, X., Jing, L., Hu X., Ng M. and Zhou, X. 2007. A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clustering. http://www.pages.drexel.edu/~xz38/pdf/209_Zhang_DASFAA07.pdf

Collaborative Colleagues:
Paweł Lula: colleagues
Grażyna Paliwoda-Pękosz: colleagues