ACM Home Page
Please provide us with feedback. Feedback
Annotation propagation revisited for key preserving views
Full text PdfPdf (367 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the 15th ACM international conference on Information and knowledge management table of contents
Arlington, Virginia, USA
SESSION: Query processing and optimization table of contents
Pages: 632 - 641  
Year of Publication: 2006
ISBN:1-59593-433-2
Authors
Gao Cong  University of Edinburgh
Wenfei Fan  University of Edinburgh and Bell Laboratories
Floris Geerts  University of Edinburgh, Hasselt University and Transnational University of Limburg
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 41,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1183614.1183705
What is a DOI?

ABSTRACT

This paper revisits the analysis of annotation propagation from source databases to views defined in terms of conjunctive (SPJ) queries. Given a source database D, an SPJ query Q, the view Q(D) and a tuple ΔV in the view, the view (resp. source) side-effect problem is to find a minimal set ΔD of tuples such that the deletion of ΔD from D results in the deletion of ΔV from Q(D) while minimizing the side effects on the view (resp. the source). A third problem, referred to as the annotation placement problem, is to find a single base tuple ΔD such that annotation in a field of ΔD propagates to ΔV while minimizing the propagation to other fields in the view Q(D). These are important for data provenance and the management of view updates. However important, these problems are unfortunately NP-hard for most subclasses of SPJ views [5].To make the annotation propagation analysis feasible in practice, we propose a key preserving condition on SPJ views, which requires that the projection fields of an SPJ view Q retain a key of each base relation involved in Q. While this condition is less restrictive than other proposals [11, 14], it often simplifies the annotation propagation analysis. Indeed, for key-preserving SPJ views the annotation placement problem coincides with the view side-effect problem, and the view and source side-effect problems become tractable. In addition we generalize the setting of [5] by allowing ΔV to be a group of tuples to be deleted, and investigate the insertion of tuples to the view. We show that group updates make the analysis harder: these problems become NP-hard for several subclasses of SPJ views. We also show that for SPJ views the source and view side-effect problems are NP-hard for single-tuple insertion, but are tractable for some subclasses of SPJ for group insertions, in the presence or in the absence of the key preservation condition.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
4
5
 
6
 
7
B. Choi, G. Cong, W. Fan, and S. Viglas. Updating recursive XML views of relations. Submitted for publication, 2006.
8
 
9
Y. Cui and J. Widom. Run-time translation of view tuple deletions using data lineage. Technical Report, Standford University, 2001.
10
11
 
12
 
13
IBM. IBM DB2 Universal Database SQL Reference. http://www.ibm.com/software/data/db2/.
14
15
 
16
Oracle. SQL Reference. http://www.oracle.com/technology/documentation/database10g.html.
 
17
E. Rahm and H. H. Do. Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin, 23(4), 2000.
 
18
SQL server. MSDN Library. http://msdn.microsoft.com/library.
 
19
W. C. Tan. Containment of relational queries with annotation propagation. In DBPL, 2003.
 
20
J. Widom. Trio: A system for integrated management of data, accuracy, and lineage. In CIDR, 2005.


Collaborative Colleagues:
Gao Cong: colleagues
Wenfei Fan: colleagues
Floris Geerts: colleagues