ACM Home Page
Please provide us with feedback. Feedback
Composition of mappings given by embedded dependencies
Full text PdfPdf (341 KB)
Source
ACM Transactions on Database Systems (TODS) archive
Volume 32 ,  Issue 1  (March 2007) table of contents
Article No. 4  
Year of Publication: 2007
ISSN:0362-5915
Authors
Alan Nash  University of California, San Diego
Philip A. Bernstein  Microsoft Research, Redmond, WA
Sergey Melnik  Microsoft Research, Redmond, WA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 128,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1206049.1206053
What is a DOI?

ABSTRACT

Composition of mappings between schemas is essential to support schema evolution, data exchange, data integration, and other data management tasks. In many applications, mappings are given by embedded dependencies. In this article, we study the issues involved in composing such mappings.

Our algorithms and results extend those of Fagin et al. [2004], who studied the composition of mappings given by several kinds of constraints. In particular, they proved that full source-to-target tuple-generating dependencies (tgds) are closed under composition, but embedded source-to-target tgds are not. They introduced a class of second-order constraints, SO tgds, that is closed under composition and has desirable properties for data exchange.

We study constraints that need not be source-to-target and we concentrate on obtaining (first-order) embedded dependencies. As part of this study, we also consider full dependencies and second-order constraints that arise from Skolemizing embedded dependencies. For each of the three classes of mappings that we study, we provide: (a) an algorithm that attempts to compute the composition; and (b) sufficient conditions on the input mappings which guarantee that the algorithm will succeed.

In addition, we give several negative results. In particular, we show that full and second-order dependencies that are not limited to be source-to-target are not closed under composition (for the latter, under the additional restriction that no new function symbols are introduced). Furthermore, we show that determining whether the composition can be given by these kinds of dependencies is undecidable.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
Bernstein, P. A. 2003. Applying model management to classical meta-data problems. In Proceedings of the Conference on Innovative Data Systems Research (CIDR). 209--220.
4
 
5
 
6
Ebbinhaus, H.-D. and Flum, J. 1999. Finite Model Theory. Springer Verlag.
 
7
Fagin, R. 1975. Monadic generalized spectra. Zeitschr. Math. Logik Grundlagen Math. 21, 89--96.
8
9
10
 
11
12
13
14
 
15
Halevy, A. Y., Ives, Z. G., Suciu, D., and Tatarinov, I. 2003. Schema mediation in peer data management systems. In Proceedings of the International Conference on Data Engineering (ICDE).
16
 
17
18
 
19
Madhavan, J. and Halevy, A. Y. 2003. Composing mappings among data sources. In Proceedings of the International Conference on Very Large Data Bases (VLDB). 572--583.
 
20
Melnik, S. 2004. Generic model management: Concepts and algorithms. Ph.D. Thesis, University of Leipzig. Lecture Notes in Computer Science, vol. 2967. Springer Verlag.
21
22
 
23
Nicolas, J. M. 1987. First order logic formalization for function, multivalued, and mutual dependencies. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 40--46.
24
 
25
26
27


Collaborative Colleagues:
Alan Nash: colleagues
Philip A. Bernstein: colleagues
Sergey Melnik: colleagues