|
ABSTRACT
Business processes drive the operations of an enterprise. In the past, the focus was primarily on business process design, modeling, and automation. Recently, enterprises have realized that they can benefit tremendously from analyzing the behavior of their business processes with the objective of optimizing or improving them. In our research, we address the problem of warehousing business process execution data so that we can analyze their behavior using the analytic and reporting tools that are available in data warehouse environments. We build upon our previous work that described the design and implementation of a generic process data warehouse for use with any business processes. In this paper, we show how to automate the population of the generic process warehouse by tracking business events from an application environment. Typically, the source data consists of event streams that indicate changes in the business process state (i.e., progression of the process). The target schema is designed to allow querying of task and process execution data. The core of our approach for processing progression data relies on the construction of generic templates that specify the semantics of the event streams extraction and the subsequent transformations that translate the underlying IT events into business data changes. Using this extensible template mechanism, we show how to automate the construction of mappings to populate the generic process warehouse using two-levels of mappings that are applied in two-phases. Interestingly, our approach of using ETL technology for warehousing process data can be seen the other way around. An arbitrary ETL process can be modeled as a business process. Hence, we describe the benefit of modeling ETL as a business process and illustrate how to use our approach to warehouse ETL execution data, and to monitor and analyze the progress of ETL processes. Finally, we discuss implementation issues.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
BPI. HP Business Process Insight software. Available at: http://www.managementsoftware.hp.com/products/bpi, 2008.
|
| |
2
|
F. Casati, M. Castellanos, U. Dayal, M. C. Shan. A Metric Definition, Computation & Reporting Model for Business Operations Analysis. In EDBT, 2006.
|
| |
3
|
|
| |
4
|
|
| |
5
|
Jens-Peter Dittrich , Bernhard Seeger , David Scot Taylor , Peter Widmayer, Progressive merge join: a generic and non-blocking sort-based join algorithm, Proceedings of the 28th international conference on Very Large Data Bases, p.299-310, August 20-23, 2002, Hong Kong, China
|
| |
6
|
Filenet. IBM FileNet Business Activity Monitor. Available at: http://www-01.ibm.com/software/data/content-management/filenet-business-activity-monitor/
|
 |
7
|
Laura M. Haas , Mauricio A. Hernández , Howard Ho , Lucian Popa , Mary Roth, Clio grows up: from research prototype to industrial tool, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
[doi> 10.1145/1066157.1066252]
|
| |
8
|
|
| |
9
|
Kettle. Pentaho Data Integration. Available at: http://kettle.pentaho.org/, 2008.
|
| |
10
|
Ralph Kimball , Laura Reeves , Warren Thornthwaite , Margy Ross , Warren Thornwaite, The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom, John Wiley & Sons, Inc., New York, NY, 1998
|
| |
11
|
Mondrian. Pentaho's Mondrian Project. Available at: http://mondrian.pentaho.org/, 2008.
|
| |
12
|
Openadaptor. Available at: https://www.openadaptor.org/, 2008.
|
| |
13
|
N. Polyzotis, S. Skiadopoulos, P. Vassiliadis, A. Simitsis, N.-E. Frantzell. Supporting Streaming Updates in an Active Data Warehouse. In ICDE, pp. 476--485, 2007.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
|
 |
19
|
Yufei Tao , Man Lung Yiu , Dimitris Papadias , Marios Hadjieleftheriou , Nikos Mamoulis, RPJ: producing fast join results on streams through rate-based optimization, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
[doi> 10.1145/1066157.1066200]
|
| |
20
|
C. Thomsen, T. B. Pedersen, W. Lehner. RiTE: Providing On-Demand Data for Right-Time Data Warehousing. In ICDE, pp. 456--465, 2008.
|
 |
21
|
|
| |
22
|
P. Vassiliadis, A. Simitsis. Near Real Time ETL. In Springer journal Annals of Information Systems, Volume 3, Special issue on "New Trends in Data Warehousing and Data Analysis", 2008.
|
| |
23
|
Velocity. The Apache Velocity Project. Available at: http://velocity.apache.org/, 2008.
|
| |
24
|
T. Urhan, M. J. Franklin: XJoin: A Reactively-Scheduled Pipelined Join Operator. In IEEE Data Eng. Bull. 23(2), pp. 27--33, 2000.
|
| |
25
|
WS-BPEL Version 2.0, Oasis. Available at: http://docs.oasis-open.org/wsbpel/2.0/wsbpel-v2.0.pdf
|
|