| Learning the distance metric in a personal ontology |
| Full text |
Pdf
(440 KB)
|
Source
|
Conference on Information and Knowledge Management
archive
Proceeding of the 2nd international workshop on Ontologies and nformation systems for the semantic web
table of contents
Napa Valley, California, USA
SESSION: Session 1
table of contents
Pages 17-24
Year of Publication: 2008
ISBN:978-1-60558-255-9
|
|
Authors
|
|
Hui Yang
|
Carnegie Mellon University, Pittsburgh, PA, USA
|
|
Jamie Callan
|
Carnegie Mellon University, Pittsburgh, PA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 75, Citation Count: 0
|
|
|
ABSTRACT
Personal ontology construction is the task of sorting through relevant materials, identifying the main topics and concepts, and organizing them to suit personal needs. Automatic construction of personal ontologies is difficult in part because measuring the semantic distance between two concepts is difficult. Knowledge-based approaches use either knowledge bases, such as WordNet, or lexico-syntactic patterns to induce the differences between concepts. However, these techniques are only applicable for a subset of concepts and leave the majority unmeasurable. On the other hand, statistical approaches are able to induce the differences between any concept pair but lack of human knowledge involvement and hence suffer from low precision. In the context of personal ontology construction, semantic distances between concepts need to reflect personal preferences. Based on that, this paper presents a supervised hierarchical clustering framework to incorporate personal preferences for distance metric learning in personal ontology construction. In this framework, periodic manual guidance provides training data for learning a distance metric and the learned metric is used during automatic activities to further construct the ontology. A detailed user study demonstrates that the approach is effective and accelerates the construction of personal ontologies.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. Bar-Hillel, T. Hertz, N. Shental, and D. Weinshall. Learning distance functions using equivalence relations. In Proceedings of the International Conference on Machine Learning, 2003.
|
| |
2
|
P. Cimiano, A. Hotho, and S. Staab. Comparing conceptual, divisive and agglomerative clustering for learning taxonomies from text. In Proceedings of the European Conference on Artificial Intelligence, 2004.
|
| |
3
|
P. Cimiano and J. Vlker. Text2onto: A framework for ontology learning and data-driven change discovery. In Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems, 2005.
|
| |
4
|
|
| |
5
|
D. Elliman and J. R. G. Pulido. Automatic derivation of on-line document ontologies. In International Workshop on Mechanisms for Enterprise Integration: From Objects to Ontology, 2001.
|
| |
6
|
C. Fellbaum. WordNet:An Electronic Lexical Database. MIT Press, 1998.
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
L. Khan and L. Wang. Automatic ontology derivation using clustering for image classification. In In Proc. of 8th International Workshop on Multimedia Information Systems, 2002.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
R. Tibshirani, G. Walther, and T. Hastie. Estimating the number of clusters in a dataset via the gap statistic. In Tech. Rep. 208, Dept. of Statistics, Stanford University, 2000.
|
| |
19
|
E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell. Distance metric learning, with application to clustering with side-information. In Advances in Neural Information Processing Systems, 2002.
|
 |
20
|
|
|