|
ABSTRACT
This work addresses data warehouse maintenance, i.e. how changes to autonomous, heterogeneous, and distributed sources should be detected and propagated to a warehouse. The research community has mainly addressed issues relating to the internal operation of data warehouse servers. Work related to data warehouse maintenance has received less attention and only a limited set of maintenance alternatives are considered while ignoring the autonomy and heterogeneity of sources.In this paper, we extend work on single source view maintenance to views with multiple heterogeneous sources. We present a tool (PAM) which allows for comparison of a large number of relevant maintenance policies under different configurations. Based on such analysis and previous studies we propose a set of heuristics to guide in policy selection. The quality of these heuristics is evaluated empirically using a test-bed developed for this purpose. This is done for a number of different criteria and for different data sources and computer systems. The performance gained using the policy selected through the heuristics is compared with the performance of all identified policies. Based on these experiments we claim that heuristic-based selections are good.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
D. Agrawal , A. El Abbadi , A. Singh , T. Yurek, Efficient view maintenance at data warehouses, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.417-427, May 11-15, 1997, Tucson, Arizona, United States
|
| |
2
|
|
| |
3
|
|
 |
4
|
Latha S. Colby , Akira Kawaguchi , Daniel F. Lieuwen , Inderpal Singh Mumick , Kenneth A. Ross, Supporting multiple view maintenance policies, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.405-416, May 11-15, 1997, Tucson, Arizona, United States
|
| |
5
|
|
| |
6
|
H. Engström, S. Chakravarthy, and B. Lings. Data integration in heterogeneous environments: Multi-source policies, cost model, and implementation. Technical report, University of Skövde, Sweden, 2002.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
A. Gupta and I. S. Mumick. Maintenance of materialized views: Problems, techniques, and applications. IEEE Data Engineering Bulletin, 18(2):3--18, 1995.
|
| |
11
|
J. Hammer, H. Garcia-Molina, J. Widom, W. Labio, and Y. Zhuge. The Stanford data warehousing project. IEEE Data Engineering Bulletin, 18(2):41--48, 1995.
|
 |
12
|
|
 |
13
|
Venky Harinarayan , Anand Rajaraman , Jeffrey D. Ullman, Implementing data cubes efficiently, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.205-216, June 04-06, 1996, Montreal, Quebec, Canada
|
 |
14
|
|
 |
15
|
|
| |
16
|
M. Lee and J. Hammer. Speeding up warehouse physical design using a randomized algorithm. DMDW Workshop, 1999.
|
| |
17
|
Dallan Quass , Ashish Gupta , Inderpal Singh Mumick , Jennifer Widom, Making views self-maintainable for data warehousing, Proceedings of the fourth international conference on on Parallel and distributed information systems, p.158-169, December 18-20, 1996, Miami Beach, Florida, United States
|
| |
18
|
|
| |
19
|
|
 |
20
|
|
| |
21
|
|
 |
22
|
Hui Wang , Maria Orlowska , Weifa Liang, Efficient refreshment of materialized views with multiple sources, Proceedings of the eighth international conference on Information and knowledge management, p.375-382, November 02-06, 1999, Kansas City, Missouri, United States
[doi> 10.1145/319950.320030]
|
| |
23
|
|
| |
24
|
|
 |
25
|
Yue Zhuge , Héctor García-Molina , Joachim Hammer , Jennifer Widom, View maintenance in a warehousing environment, Proceedings of the 1995 ACM SIGMOD international conference on Management of data, p.316-327, May 22-25, 1995, San Jose, California, United States
|
| |
26
|
Yue Zhuge , Hector Garcia-Molina , Janet L. Wiener, The Strobe algorithms for multi-source warehouse consistency, Proceedings of the fourth international conference on on Parallel and distributed information systems, p.146-157, December 18-20, 1996, Miami Beach, Florida, United States
|
|