ACM Home Page
Please provide us with feedback. Feedback
Real-time data warehouse loading methodology
Full text PdfPdf (416 KB)
Source
ACM International Conference Proceeding Series; Vol. 299 archive
Proceedings of the 2008 international symposium on Database engineering & applications table of contents
Coimbra, Portugal
SESSION: Real-time database systems table of contents
Pages 49-58  
Year of Publication: 2008
ISBN:978-1-60558-188-0
Authors
Ricardo Jorge Santos  University of Coimbra, Coimbra, Portugal
Jorge Bernardino  Superior Institute of Engineering of Coimbra, Coimbra, Portugal
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 84,   Downloads (12 Months): 565,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1451940.1451949
What is a DOI?

ABSTRACT

A data warehouse provides information for analytical processing, decision making and data mining tools. As the concept of real-time enterprise evolves, the synchronism between transactional data and data warehouses, statically implemented, has been redefined. Traditional data warehouse systems have static structures of their schemas and relationships between data, and therefore are not able to support any dynamics in their structure and content. Their data is only periodically updated because they are not prepared for continuous data integration. For real-time enterprises with needs in decision support purposes, real-time data warehouses seem to be very promising. In this paper we present a methodology on how to adapt data warehouse schemas and user-end OLAP queries for efficiently supporting real-time data integration. To accomplish this, we use techniques such as table structure replication and query predicate restrictions for selecting data, to enable continuously loading data in the data warehouse with minimum impact in query execution time. We demonstrate the efficiency of the method by analyzing its impact in query performance using benchmark TPC-H executing query workloads while simultaneously performing continuous data integration at various insertion time rates.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
 
3
T. Binder, 2003. Gong User Manual, Tecco Software Entwicklung AG.
 
4
M. Bouzeghoub, F. Fabret, and M. Matulovic, 1999. "Modeling Data Warehouse Refreshment Process as a Workflow Application", Intern. Workshop on Design and Management of Data Warehouses (DMDW).
 
5
 
6
7
 
8
 
9
10
 
11
 
12
R. Kimball, and J. Caserta, 2004. The Data Warehouse ETL Toolkit, Wiley Computer Publishing.
 
13
 
14
 
15
D. Lomet, and J. Gehrke, 2003. Special Issue on Data Stream Processing, IEEE Data Eng. Bulletin, 26(1).
 
16
Oracle Corporation, 2005. www.oracle.com
 
17
T. B. Pedersen, 2004. "How is BI Used in Industry?", Int. Conf. on Data Warehousing and Knowledge Discovery (DAWAK).
 
18
 
19
20
 
21
D. Theodoratus, and M. Bouzeghoub, 1999. "Data Currency Quality Factors in Data Warehouse Design", International Workshop on the Design and Management of Data Warehouses (DMDW).
 
22
TPC-H decision support benchmark, Transaction Processing Council, www.tpc.com.
 
23
 
24
C. White, 2002. "Intelligent Business Strategies: Real-Time Data Warehousing Heats Up", DM Preview, www.dmreview.com/article_sub_cfm?articleId=5570.
 
25
 
26
 
27
 
28

Collaborative Colleagues:
Ricardo Jorge Santos: colleagues
Jorge Bernardino: colleagues