ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Automating the loading of business process data warehouses
Full text PdfPdf (629 KB)
Source Extending Database Technology; Vol. 360 archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology table of contents
Saint Petersburg, Russia
SESSION: Research sessions: System architectures table of contents
Pages: 612-623  
Year of Publication: 2009
ISBN:978-1-60558-422-5
Authors
Malu Castellanos  HP Labs, Palo Alto, Ca
Alkis Simitsis  HP Labs, Palo Alto, Ca
Kevin Wilkinson  HP Labs, Palo Alto, Ca
Umeshwar Dayal  HP Labs, Palo Alto, Ca
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 369,   Citation Count: 1
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1516360.1516431
What is a DOI?

ABSTRACT

Business processes drive the operations of an enterprise. In the past, the focus was primarily on business process design, modeling, and automation. Recently, enterprises have realized that they can benefit tremendously from analyzing the behavior of their business processes with the objective of optimizing or improving them. In our research, we address the problem of warehousing business process execution data so that we can analyze their behavior using the analytic and reporting tools that are available in data warehouse environments. We build upon our previous work that described the design and implementation of a generic process data warehouse for use with any business processes. In this paper, we show how to automate the population of the generic process warehouse by tracking business events from an application environment. Typically, the source data consists of event streams that indicate changes in the business process state (i.e., progression of the process). The target schema is designed to allow querying of task and process execution data. The core of our approach for processing progression data relies on the construction of generic templates that specify the semantics of the event streams extraction and the subsequent transformations that translate the underlying IT events into business data changes. Using this extensible template mechanism, we show how to automate the construction of mappings to populate the generic process warehouse using two-levels of mappings that are applied in two-phases. Interestingly, our approach of using ETL technology for warehousing process data can be seen the other way around. An arbitrary ETL process can be modeled as a business process. Hence, we describe the benefit of modeling ETL as a business process and illustrate how to use our approach to warehouse ETL execution data, and to monitor and analyze the progress of ETL processes. Finally, we discuss implementation issues.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
BPI. HP Business Process Insight software. Available at: http://www.managementsoftware.hp.com/products/bpi, 2008.
 
2
F. Casati, M. Castellanos, U. Dayal, M. C. Shan. A Metric Definition, Computation & Reporting Model for Business Operations Analysis. In EDBT, 2006.
 
3
 
4
 
5
 
6
Filenet. IBM FileNet Business Activity Monitor. Available at: http://www-01.ibm.com/software/data/content-management/filenet-business-activity-monitor/
7
 
8
 
9
Kettle. Pentaho Data Integration. Available at: http://kettle.pentaho.org/, 2008.
 
10
 
11
Mondrian. Pentaho's Mondrian Project. Available at: http://mondrian.pentaho.org/, 2008.
 
12
Openadaptor. Available at: https://www.openadaptor.org/, 2008.
 
13
N. Polyzotis, S. Skiadopoulos, P. Vassiliadis, A. Simitsis, N.-E. Frantzell. Supporting Streaming Updates in an Active Data Warehouse. In ICDE, pp. 476--485, 2007.
 
14
 
15
 
16
17
 
18
19
 
20
C. Thomsen, T. B. Pedersen, W. Lehner. RiTE: Providing On-Demand Data for Right-Time Data Warehousing. In ICDE, pp. 456--465, 2008.
21
 
22
P. Vassiliadis, A. Simitsis. Near Real Time ETL. In Springer journal Annals of Information Systems, Volume 3, Special issue on "New Trends in Data Warehousing and Data Analysis", 2008.
 
23
Velocity. The Apache Velocity Project. Available at: http://velocity.apache.org/, 2008.
 
24
T. Urhan, M. J. Franklin: XJoin: A Reactively-Scheduled Pipelined Join Operator. In IEEE Data Eng. Bull. 23(2), pp. 27--33, 2000.
 
25
WS-BPEL Version 2.0, Oasis. Available at: http://docs.oasis-open.org/wsbpel/2.0/wsbpel-v2.0.pdf

Collaborative Colleagues:
Malu Castellanos: colleagues
Alkis Simitsis: colleagues
Kevin Wilkinson: colleagues
Umeshwar Dayal: colleagues