ACM Home Page
Please provide us with feedback. Feedback
Efficient core computation in data exchange
Full text PdfPdf (474 KB)
Source
Journal of the ACM (JACM) archive
Volume 55 ,  Issue 2  (May 2008) table of contents
Article No. 9  
Year of Publication: 2008
ISSN:0004-5411
Authors
Georg Gottlob  University of Oxford, Oxford, United Kingdom
Alan Nash  IBM Almaden Research Center, San Jose, California
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 212,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1346330.1346334
What is a DOI?

ABSTRACT

Data exchange deals with inserting data from one database into another database having a different schema. Fagin et al. [2005] have shown that among the universal solutions of a solvable data exchange problem, there exists—up to isomorphism—a unique most compact one, “the core”, and have convincingly argued that this core should be the database to be materialized. They stated as an important open problem whether the core can be computed in polynomial time in the general setting where the mapping between the source and target schemas is given by source-to-target constraints that are arbitrary tuple generating dependencies (tgds) and target constraints consisting of equality generating dependencies (egds) and a weakly acyclic set of tgds. In this article, we solve this problem by developing new methods for efficiently computing the core of a universal solution. This positive result shows that data exchange based on cores is feasible and applicable in a very general setting. In addition to our main result, we use the method of hypertree decompositions to derive new algorithms and upper bounds for query containment checking and computing cores of arbitrary database instances. We also show that computing the core of a data exchange problem is fixed-parameter intractable with respect to a number of relevant parameters, and that computing cores is NP-complete if the rule bodies of target tgds are augmented by a special predicate that distinguishes a null value from a constant data value.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Adler, I. 2004. Marshals, monotone marshals, and hypertree-width. J. Graph Theory 47, 4, 275--296.
 
3
4
5
 
6
Aho, A., Sagiv, J., and Ullman, J. 1979c. Equivalence of relational expressions. SIAM J. Comput. 8, 2, 218--246.
7
 
8
9
 
10
 
11
 
12
Downey, R., Fellows, M., and Tailor, U. 1996. The parameterized complexity of relational database queries and an improved characterization of W[1]. In Combinatorics, Complexity, and Logic—Proceedings DMTCS'96, D. S. Bridges, C. S. Calude, J. Gibbons, S. Reeves, and I. H. Witten, Eds.
 
13
Downey, R. G., and Fellows, M. R. 1999. Parameterized Complexity. Springer-Verlag, New York.
14
 
15
Fagin, R. 2005. Extending the core greedy algorithm to allow target TGDS with singleton left-hand sides. Unpublished manuscript.
 
16
17
18
19
 
20
21
 
22
23
 
24
Gottlob, G., Leone, N., and Scarcello, F. 2002. Hypertree decompositions and tractable queries. J. Comput. Syst. Sci. 64, 3, 579--627.
 
25
26
 
27
28
29
 
30
 
31
32
33
 
34
35
 
36
 
37
 
38
Pichler, R., and Savenkov, V. 2008. Implementing core computation for data exchange. Tech. Rep. DBAI-TR-2008-01, Institut für Informationssysteme, Technische Universität Wien, A-1040 Vienna, Austria.
 
39
 
40
Robertson, N., and Seymour, P. 1986. Graph minors II: Algorithmic aspects of tree-width. J. Algor. 7, 309--322.
41


Collaborative Colleagues:
Georg Gottlob: colleagues
Alan Nash: colleagues