|
ABSTRACT
We study the problem of answering queries through a target schema, given a set of mappings between one or more source schemas and this target schema, and given that the data is at the sources. The schemas can be any combination of relational or XML schemas, and can be independently designed. In addition to the source-to-target mappings, we consider as part of the mapping scenario a set of target constraints specifying additional properties on the target schema. This becomes particularly important when integrating data from multiple data sources with overlapping data and when such constraints can express data merging rules at the target. We define the semantics of query answering in such an integration scenario, and design two novel algorithms, basic query rewrite and query resolution, to implement the semantics. The basic query rewrite algorithm reformulates target queries in terms of the source schemas, based on the mappings. The query resolution algorithm generates additional rewritings that merge related information from multiple sources and assemble a coherent view of the data, by incorporating target constraints. The algorithms are implemented and then evaluated using a comprehensive set of experiments based on both synthetic and real-life data integration scenarios.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Bernd Amann , Catriel Beeri , Irini Fundulaki , Michel Scholl, Querying XML Sources Using an Ontology-Based Mediator, On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002, p.429-448, October 30-November 01, 2002
|
 |
3
|
Catriel Beeri , Alon Y. Levy , Marie-Christine Rousset, Rewriting queries using views in description logics, Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.99-108, May 11-15, 1997, Tucson, Arizona, United States
[doi> 10.1145/263661.263673]
|
 |
4
|
|
 |
5
|
Michael Benedikt , Chee-Yong Chan , Wenfei Fan , Juliana Freire , Rajeev Rastogi, Capturing both types and constraints in data integration, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, June 09-12, 2003, San Diego, California
[doi> 10.1145/872757.872792]
|
 |
6
|
Diego Calvanese , Moshe Y. Vardi , Giuseppe de Giacomo , Maurizio Lenzerini, View-based query processing for regular path queries with inverse, Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.58-66, May 15-18, 2000, Dallas, Texas, United States
[doi> 10.1145/335168.335207]
|
| |
7
|
|
| |
8
|
A. Chapman, C. Yu, and H. V. Jagadish. Effective integration of protein data through better data modeling. OMICS: A Journal of Integrative Biology, 7(1):101--102, 2003.
|
| |
9
|
|
| |
10
|
A. Deutsch and V. Tannen. MARS: A system for publishing XML from mixed and redundant storage. In VLDB, 2003.
|
| |
11
|
|
| |
12
|
O. Duschka, M. Genesereth, and A. Levy. Recursive query plans for data integration. Journal of Logic Programming, 43(1):49--73, 2000.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
Marc Friedman , Alon Levy , Todd Millstein, Navigational plans for data integration, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, p.67-73, July 18-22, 1999, Orlando, Florida, United States
|
| |
17
|
|
 |
18
|
|
| |
19
|
R. Krishnamurthy, R. Kaushik, and J. F. Naughton. XML-SQL Query Translation Literature: The State of the Art and Open Problems. In XSym, 2003.
|
 |
20
|
|
 |
21
|
Alon Y. Levy , Alberto O. Mendelzon , Yehoshua Sagiv, Answering queries using views (extended abstract), Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.95-104, May 22-25, 1995, San Jose, California, United States
[doi> 10.1145/212433.220198]
|
| |
22
|
|
| |
23
|
|
 |
24
|
|
| |
25
|
|
| |
26
|
L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernández, and R. Fagin. Translating web data. In VLDB, 2002.
|
| |
27
|
|
| |
28
|
|
| |
29
|
|
CITED BY 26
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Patricia Rodríguez-Gianolli , Anastasios Kementsietsidis , Maddalena Garzetti , Iluju Kiringa , Lei Jiang , Mehedi Masud , Renée J. Miller , John Mylopoulos, Data sharing in the Hyperion peer database system, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
|
|
M. Roth , M. A. Hernandez , P. Coulthard , L. Yan , L. Popa , H. C.-T. Ho , C. C. Salter, XML Mapping technology: making connections in an XML-centric world, IBM Systems Journal, v.45 n.2, p.389-409, January 2006
|
|
|
|
|
|
Ariel Fuxman , Mauricio A. Hernandez , Howard Ho , Renee J. Miller , Paolo Papotti , Lucian Popa, Nested mappings: schema mapping reloaded, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
Nicola Onose , Alin Deutsch , Yannis Papakonstantinou , Emiran Curtmola, Rewriting nested XML queries using nested views, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
|
|
|
|
|
|
|
|
|
Biao Qin , Shan Wang , Xiaoyong Du , Qiming Chen , Qiuyue Wang, Graph-based query rewriting for knowledge sharing between peer ontologies, Information Sciences: an International Journal, v.178 n.18, p.3525-3542, September, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|