ACM Home Page
Please provide us with feedback. Feedback
Learning the distance metric in a personal ontology
Full text PdfPdf (440 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 2nd international workshop on Ontologies and nformation systems for the semantic web table of contents
Napa Valley, California, USA
SESSION: Session 1 table of contents
Pages 17-24  
Year of Publication: 2008
ISBN:978-1-60558-255-9
Authors
Hui Yang  Carnegie Mellon University, Pittsburgh, PA, USA
Jamie Callan  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 75,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458484.1458488
What is a DOI?

ABSTRACT

Personal ontology construction is the task of sorting through relevant materials, identifying the main topics and concepts, and organizing them to suit personal needs. Automatic construction of personal ontologies is difficult in part because measuring the semantic distance between two concepts is difficult. Knowledge-based approaches use either knowledge bases, such as WordNet, or lexico-syntactic patterns to induce the differences between concepts. However, these techniques are only applicable for a subset of concepts and leave the majority unmeasurable. On the other hand, statistical approaches are able to induce the differences between any concept pair but lack of human knowledge involvement and hence suffer from low precision.

In the context of personal ontology construction, semantic distances between concepts need to reflect personal preferences. Based on that, this paper presents a supervised hierarchical clustering framework to incorporate personal preferences for distance metric learning in personal ontology construction. In this framework, periodic manual guidance provides training data for learning a distance metric and the learned metric is used during automatic activities to further construct the ontology. A detailed user study demonstrates that the approach is effective and accelerates the construction of personal ontologies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Bar-Hillel, T. Hertz, N. Shental, and D. Weinshall. Learning distance functions using equivalence relations. In Proceedings of the International Conference on Machine Learning, 2003.
 
2
P. Cimiano, A. Hotho, and S. Staab. Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. In Proceedings of the European Conference on Artificial Intelligence, 2004.
 
3
P. Cimiano and J. Vlker. Text2onto: A framework for ontology learning and data-driven change discovery. In Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems, 2005.
 
4
 
5
D. Elliman and J. R. G. Pulido. Automatic derivation of on-line document ontologies. In International Workshop on Mechanisms for Enterprise Integration: From Objects to Ontology, 2001.
 
6
C. Fellbaum. WordNet:An Electronic Lexical Database. MIT Press, 1998.
7
 
8
9
 
10
 
11
 
12
L. Khan and L. Wang. Automatic ontology derivation using clustering for image classification. In In Proc. of 8th International Workshop on Multimedia Information Systems, 2002.
13
 
14
15
 
16
 
17
 
18
R. Tibshirani, G. Walther, and T. Hastie. Estimating the number of clusters in a dataset via the gap statistic. In Tech. Rep. 208, Dept. of Statistics, Stanford University, 2000.
 
19
E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell. Distance metric learning, with application to clustering with side-information. In Advances in Neural Information Processing Systems, 2002.
20