|
ABSTRACT
Ontologies play a prominent role on the Semantic Web. They make possible the widespread publication of machine understandable data, opening myriad opportunities for automated information processing. However, because of the Semantic Web's distributed nature, data on it will inevitably come from many different ontologies. Information processing across ontologies is not possible without knowing the semantic mappings between their elements. Manually finding such mappings is tedious, error-prone, and clearly not possible at the Web scale. Hence, the development of tools to assist in the ontology mapping process is crucial to the success of the Semantic Web.We describe glue, a system that employs machine learning techniques to find such mappings. Given two ontologies, for each concept in one ontology glue finds the most similar concept in the other ontology. We give well-founded probabilistic definitions to several practical similarity measures, and show that glue can work with all of them. This is in contrast to most existing approaches, which deal with a single similarity measure. Another key feature of glue is that it uses multiple learning strategies, each of which exploits a different type of information either in the data instances or in the taxonomic structure of the ontologies. To further improve matching accuracy, we extend glue to incorporate commonsense knowledge and domain constraints into the matching process. For this purpose, we show that relaxation labeling, a well-known constraint optimization technique used in computer vision and other fields, can be adapted to work efficiently in our context. Our approach is thus distinguished in that it works with a variety of well-defined similarity notions and that it efficiently incorporates multiple types of knowledge. We describe a set of experiments on several real-world domains, and show that glue proposes highly accurate semantic mappings.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
www.daml.org.
|
| |
3
|
www.google.com.
|
| |
4
|
IEEE Intelligent Systems, 16(2), 2001.
|
| |
5
|
A. Agresti. Categorical Data Analysis. Wiley, New York, NY, 1990.
|
| |
6
|
T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, 279, 2001.
|
| |
7
|
D. Brickley and R. Guha. Resource Description Framework Schema Specification 1.0, 2000.
|
 |
8
|
Jeen Broekstra , Michel Klein , Stefan Decker , Dieter Fensel , Frank van Harmelen , Ian Horrocks, Enabling knowledge representation on the Web by extending RDF schema, Proceedings of the 10th international conference on World Wide Web, p.467-478, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372105]
|
| |
9
|
D. Calvanese, D. G. Giuseppe, and M. Lenzerini. Ontology of Integration and Integration of Ontologies. In Proceedings of the 2001 Description Logic Workshop (DL 2001).
|
 |
10
|
Soumen Chakrabarti , Byron Dom , Piotr Indyk, Enhanced hypertext categorization using hyperlinks, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.307-318, June 01-04, 1998, Seattle, Washington, United States
|
| |
11
|
H. Chalupsky. Ontomorph: A Translation system for symbolic knowledge. In Principles of Knowledge Representation and Reasoning, 2000.
|
 |
12
|
AnHai Doan , Pedro Domingos , Alon Y. Halevy, Reconciling schemas of disparate data sources: a machine-learning approach, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.509-520, May 21-24, 2001, Santa Barbara, California, United States
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
R. Hummel and S. Zucker. On the Foundations of Relaxation Labeling Processes. PAMI, 5(3):267--287, May 1983.
|
| |
17
|
R. Ichise, H. Takeda, and S. Honiden. Rule Induction for Concept Hierarchy Alignment. In Proceedings of the Workshop on Ontology Learning at the 17th International Joint Conference on Artificial Intelligence (IJCAI), 2001.
|
| |
18
|
|
| |
19
|
|
| |
20
|
S. Lloyd. An optimization approach to relaxation labeling algorithms. Image and Vision Computing, 1(2), 1983.
|
| |
21
|
|
| |
22
|
A. Maedche. A Machine Learning Perspective for the Semantic Web. Semantic Web Working Symposium (SWWS) Position Paper, 2001.
|
| |
23
|
|
| |
24
|
|
| |
25
|
S. Melnik, H. Molina-Garcia, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm. In Proceedings of the International Conference on Data Engineering (ICDE), 2002.
|
| |
26
|
|
| |
27
|
P. Mitra, G. Wiederhold, and J. Jannink. Semi-automatic Integration of Knowledge Sources. In Proceedings of Fusion'99.
|
| |
28
|
|
| |
29
|
N. Noy and M. Musen. Anchor-PROMPT: Using Non-Local Context for Semantic Matching. In Proceedings of the Workshop on Ontologies and Information Sharing at the International Joint Conference on Artificial Intelligence (IJCAI), 2001.
|
| |
30
|
B. Omelayenko. Learning of Ontologies for the Web: the Analysis of Existent approaches. In Proceedings of the International Workshop on Web Dynamics, 2001.
|
| |
31
|
L. Padro. A Hybrid Environment for Syntax-Semantic Tagging, 1998.
|
| |
32
|
N. Pernelle, M.-C. Rousset, and V. Ventos. Automatic Construction and Refinement of a Class Hierarchy over Semi-Structured Data. In Proceeding of the Workshop on Ontology Learning at the 17th International Joint Conference on Artificial Intelligence (IJCAI), 2001.
|
| |
33
|
|
| |
34
|
K. M. Ting and I. H. Witten. Issues in stacked generalization. Journal of Artificial Intelligence Research (JAIR), 10:271--289, 1999.
|
| |
35
|
M. Uschold. Where is the semantics in the Semantic Web? In Workshop on Ontologies in Agent Systems (OAS) at the 5th International Conference on Autonomous Agents, 2001.
|
| |
36
|
|
| |
37
|
|
 |
38
|
Ling Ling Yan , Renée J. Miller , Laura M. Haas , Ronald Fagin, Data-driven understanding and refinement of schema mappings, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.485-496, May 21-24, 2001, Santa Barbara, California, United States
|
CITED BY 107
|
|
|
|
|
|
|
|
Jayant Madhavan , Philip A. Bernstein , Pedro Domingos , Alon Y. Halevy, Representing and reasoning about mappings between domain models, Eighteenth national conference on Artificial intelligence, p.80-86, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Abhijit A. Patil , Swapna A. Oundhakar , Amit P. Sheth , Kunal Verma, Meteor-s web service annotation framework, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
|
|
|
Abhijit A. Patil , Swapna A. Oundhakar , Amit P. Sheth , Kunal Verma, Meteor-s web service annotation framework, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
Luke McDowell , Oren Etzioni , Alon Halevy , Henry Levy, Semantic email, Proceedings of the 13th international conference on World Wide Web, May 17-20, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jie Tang , Juanzi Li , Bangyong Liang , Xiaotong Huang , Yi Li , Kehong Wang, Using Bayesian decision for ontology mapping, Web Semantics: Science, Services and Agents on the World Wide Web, v.4 n.4, p.243-262, December, 2006
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. Orgun , M. Dras , A. Nayak , G. James, Approaches for semantic interoperability between domain ontologies, Proceedings of the second Australasian workshop on Advances in ontologies, p.41-50, December 05-05, 2006, Hobart, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Michael L. Wick , Khashayar Rohanimanesh , Karl Schultz , Andrew McCallum, A unified approach for schema matching, coreference and canonicalization, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bo Hu , Srinandan Dasmahapatra , Paul Lewis , Nigel Shadbolt, On capturing semantics in ontology mapping, Proceedings of the 22nd national conference on Artificial intelligence, p.311-316, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jérôme David , Fabrice Guillet , Régis Gras , Henri Briand, Conceptual hierarchies matching: an approach based on discovery of implication rules between concepts, Proceeding of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy, p.357-361, May 22, 2006
|
|
|
|
|