| Detecting and resolving unsound workflow views for correct provenance analysis |
| Full text |
Pdf
(958 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 35th SIGMOD international conference on Management of data
table of contents
Providence, Rhode Island, USA
SESSION: Research session 14: understanding data and queries
table of contents
Pages 549-562
Year of Publication: 2009
ISBN:978-1-60558-551-2
|
|
Authors
|
|
Peng Sun
|
Arizona State University, Tempe, AZ, USA
|
|
Ziyang Liu
|
Arizona State University, Tempe, AZ, USA
|
|
Susan B. Davidson
|
University of Pennsylvania, Philadelphia, PA, USA
|
|
Yi Chen
|
Arizona State University, Tempe, AZ, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 45, Downloads (12 Months): 154, Citation Count: 0
|
|
|
ABSTRACT
Workflow views abstract groups of tasks in a workflow into high level composite tasks, in order to reuse sub-workflows and facilitate provenance analysis. However, unless a view is carefully designed, it may not preserve the dataflow between tasks in the workflow, i.e., it may not be sound. Unsound views can be misleading and cause incorrect provenance analysis. This paper studies the problem of efficiently identifying and correcting unsound workflow views with minimal changes. In particular, given a workflow view, we wish to split each unsound composite task into the minimal number of tasks, such that the resulting view is sound. We prove that this problem is NP-hard by reduction from independent set. We then propose two local optimality conditions (weak and strong), and design polynomial time algorithms for correcting unsound views to meet these conditions. Experiments show that our proposed algorithms are effective and efficient, and that the strong local optimality algorithm produces better solutions than the weak local optimality algorithm with little processing overhead.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
myExperiment. http://www.myexperiment.org/Workflows.
|
| |
2
|
Open provenance model, 2008.
|
 |
3
|
Martín Abadi , Anindya Banerjee , Nevin Heintze , Jon G. Riecke, A core calculus of dependency, Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, p.147-160, January 20-22, 1999, San Antonio, Texas, United States
[doi> 10.1145/292540.292555]
|
| |
4
|
Ilkay Altintas , Chad Berkley , Efrat Jaeger , Matthew Jones , Bertram Ludascher , Steve Mock, Kepler: An Extensible System for Design and Execution of Scientific Workflows, Proceedings of the 16th International Conference on Scientific and Statistical Database Management, p.423, June 21-23, 2004
[doi> 10.1109/SSDBM.2004.44]
|
| |
5
|
|
 |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
J. Cheney. Program Slicing and Data Provenance. In IEEE Data Eng. Bull., volume 30(4), pages 22--28, 2007.
|
| |
10
|
J. Cheney, A. Ahmed, and U. A. Acar. Provenance as Dependency Analysis. In Database Programming Languages, pages 138--152, 2007.
|
| |
11
|
Dickson K. W. Chiu , S. C. Cheung , Sven Till , Kamalakar Karlapalem , Qing Li , Eleanna Kafeza, Workflow View Driven Cross-Organizational Interoperability in a Web Service Environment, Information Technology and Management, v.5 n.3-4, p.221-250, July-October 2004
[doi> 10.1023/B:ITEM.0000031580.57966.d4]
|
| |
12
|
S. Cohen, S. C. Boulakia, and S. B. Davidson. Towards a Model of Provenance and User Views in Scientific Workflows. In DILS, 2006.
|
| |
13
|
|
 |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
R. B. A. Kamel Barkaoui and Z. Sbai. Workflow Soundness Verification based on Structure Theory of Petri Nets. In International Journal of Computing and Information Sciences, 2007.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Tom Oinn , Matthew Addis , Justin Ferris , Darren Marvin , Martin Senger , Mark Greenwood , Tim Carver , Kevin Glover , Matthew R. Pocock , Anil Wipat , Peter Li, Taverna: a tool for the composition and enactment of bioinformatics workflows, Bioinformatics, v.20 n.17, p.3045-3054, November 2004
[doi> 10.1093/bioinformatics/bth361]
|
 |
22
|
Shrija Rajbhandari , Omer F. Rana , Ian Wootten, A fuzzy model for calculating workflow trust using provenance data, Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities, January 29-February 03, 2008, Baton Rouge, Louisiana
[doi> 10.1145/1341811.1341823]
|
| |
23
|
M. R. Ralph Bobrik and T. Bauer. View-Based Process Visualization. In Business Process Management, pages 88--95, 2007.
|
| |
24
|
|
| |
25
|
|
| |
26
|
|
 |
27
|
Qihong Shao , Yi Chen , Shu Tao , Xifeng Yan , Nikos Anerousis, Efficient ticket routing by resolution sequence mining, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
[doi> 10.1145/1401890.1401964]
|
| |
28
|
|
| |
29
|
|
| |
30
|
J. Siegeris and A. Zimmermann. Workflow Model Compositions Preserving Relaxed Soundness. In Business Process Management, pages 177--192, 2006.
|
| |
31
|
|
| |
32
|
|
| |
33
|
|
| |
34
|
|
|