ACM Home Page
Please provide us with feedback. Feedback
On the expressiveness of implicit provenance in query and update languages
Full text PdfPdf (783 KB)
Source
ACM Transactions on Database Systems (TODS) archive
Volume 33 ,  Issue 4  (November 2008) table of contents
Article No. 28  
Year of Publication: 2008
ISSN:0362-5915
Authors
Peter Buneman  University of Edinburgh, Scotland, UK
James Cheney  University of Edinburgh, Scotland, UK
Stijn Vansummeren  Hasselt University and Transnational University of Limburg, Diepenbeek, Belgium
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 214,   Citation Count: 2
Additional Information:

appendices and supplements   abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1412331.1412340
What is a DOI?

APPENDICES and SUPPLEMENTS
Online appendix to on the expressiveness of implicit provenance in query and update languages. The appendix supports the information on article 28.


ABSTRACT

Information describing the origin of data, generally referred to as provenance, is important in scientific and curated databases where it is the basis for the trust one puts in their contents. Since such databases are constructed using operations of both query and update languages, it is of paramount importance to describe the effect of these languages on provenance.

In this article we study provenance for query and update languages that are closely related to SQL, and compare two ways in which they can manipulate provenance so that elements of the input are rearranged to elements of the output: implicit provenance, where a query or update only provides the rearranged output, and provenance is provided implicitly by a default provenance semantics; and explicit provenance, where a query or update provides both the output and the description of the provenance of each component of the output. Although explicit provenance is in general more expressive, we show that the classes of implicit provenance operations expressible by query and update languages correspond to natural semantic subclasses of the explicit provenance queries.

One of the consequences of this study is that provenance separates the expressive power of query and update languages. The model is also relevant to annotation propagation schemes in which annotations on the input to a query or update have to be transferred to the output or vice versa.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
 
4
 
5
Bhagwat, D., Chiticariu, L., Tan, W.-C., and Vijayvargiya, G. 2005. An annotation management system for relational databases. VLDB J. 14, 4, 373--396.
6
7
 
8
Buneman, P., Cheney, J., and Vansummeren, S. 2007. On the expressiveness of implicit provenance in query and update languages. In Proceedings of the 11th International Conference on Database Theory (ICDT), T. Schwentick and D. Suciu, Eds. Lecture Notes in Computer Science, vol. 4353. Springer, Berlin, Germany, 209--223.
 
9
 
10
11
 
12
Cheney, J., Ahmed, A., and Acar, U. A. 2007. Provenance as dependency analysis. In Proceedings of the 11th International Symposium on Database Programming Languages (DBPL), Vienna, Austria, Lecture Notes in Computer Science, Vol. 4797, Springer, Berlin, Germany, 138--152.
13
 
14
Foster, I. and Moreau, L., Eds. 2006. Proceedings of the International Provenance and Annotation Workshop (IPAW). Lecture Notes in Computer Science, Vol. 4145, Springer-Verlag, Berlin, Germany.
 
15
Galperin, M. Y. 2008. The molecular biology database collection: 2008 update. Nucl. Acids Res. 36, suppl_1, D2--4.
 
16
 
17
Geerts, F. and Van den Bussche, J. 2007. Relational completeness of query languages for annotated databases. In Proceedings of the 11th International Symposium on Database Programming Languages (DBPL), Vienna, Austria, Lecture Notes in Computer Science, Vol. 4797, Springer, Berlin, Germany, 127--137.
18
 
19
 
20
21
22
 
23
 
24
Tan, W. 2003. Containment of relational queries with annotation propagation. In Proceedings of the 9th International Workshop on Database Programming Languages (DBPL). Lecture Notes in Computer Science, vol. 2921. Springer, Berlin, Germany, 37--53.
 
25
Tan, W. 2004. Research problems in data provenance. IEEE Data Eng. Bull. 27, 4, 45--52.
 
26
 
27
 
28
 
29
 
30
 
31


Collaborative Colleagues:
Peter Buneman: colleagues
James Cheney: colleagues
Stijn Vansummeren: colleagues