|
ABSTRACT
We report on a new, efficient encoding for the data cube, which results in a drastic speed-up of OLAP queries that aggregate along any combination of dimensions over numerical and categorical attributes. We are focusing on a class of queries called cube queries, which return aggregated values rather than sets of tuples. Our approach, termed CubiST++ (Cubing with Statistics Trees Plus Families), represents a drastic departure from existing relational (ROLAP) and multi-dimensional (MOLAP) approaches in that it does not use the view lattice to compute and materialize new views from existing views in some heuristic fashion. Instead, CubiST++ encodes all possible aggregate views in the leaves of a new data structure called statistics tree (ST) during a one-time scan of the detailed data. In order to optimize the queries involving constraints on hierarchy levels of the underlying dimensions, we select and materialize a family of candidate trees, which represent superviews over the different hierarchical levels of the dimensions. Given a query, our query evaluation algorithm selects the smallest tree in the family, which can provide the answer. Extensive evaluations of our prototype implementation have demonstrated its superior run-time performance and scalability when compared with existing MOLAP and ROLAP systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Sameet Agarwal , Rakesh Agrawal , Prasad Deshpande , Ashish Gupta , Jeffrey F. Naughton , Raghu Ramakrishnan , Sunita Sarawagi, On the Computation of Multidimensional Aggregates, Proceedings of the 22th International Conference on Very Large Data Bases, p.506-521, September 03-06, 1996
|
| |
2
|
|
| |
3
|
3. Arbor Systems, "Large-scale data warehousing using hyperion essbase OLAP technology," Arbor Systems, White Paper 1997.
|
 |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
|
| |
8
|
8. E.F. Codd, S.B. Codd, and C.T. Salley, "Beyond decision support," in Computer World, vol. 27, 1993, www.arborsoft.com/OLAP.html.
|
| |
9
|
9. E.F. Codd, S.B. Codd, and C.T. Salley, "Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate," Technical Report 1993.
|
 |
10
|
|
| |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Jim Gray , Surajit Chaudhuri , Adam Bosworth , Andrew Layman , Don Reichart , Murali Venkatrao , Frank Pellow , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, Data Mining and Knowledge Discovery, v.1 n.1, p.29-53, 1997
[doi> 10.1023/A:1009726021843]
|
| |
16
|
16. H. Gupta and I. Mumick, "Selection of views to materialize under a maintenance cost constraint," Stanford University, Technical Report, 1997.
|
 |
17
|
|
| |
18
|
18. Information Advantage, "Business intelligence," White Paper, 1998.
|
| |
19
|
19. Informix Corp., "Informix red brick decision server," 2001, http://www.informix.com/redbrick/.
|
| |
20
|
20. T. Johnson and D. Shasha, "Some approaches to index design for cube forests," Bulletin of the Technical Committee on Data Engineering, IEEE Computer Society, vol. 20, pp. 27-35, 1997.
|
| |
21
|
|
| |
22
|
22. M. Lee and J. Hammer, "Speeding up warehouse physical design using a randomized algorithm," International Journal of Cooperative information Systems (IJCIS)--Special Issue on Design and Management of Data Warehouses, vol. 10, pp. 327-354, 2001.
|
| |
23
|
23. D. Lomet, "Bulletin of the technical committee on data engineering," in Special Issue on Materialized Views and Data Warehousing, J. Widom (Ed.), IEEE Computer Society, 1995, vol. 18.
|
| |
24
|
24. Microsoft Corp., "Microsoft SQL server," Microsoft, Seattle, WA, White Paper.
|
| |
25
|
25. MicroStrategy Inc., "The case for relational OLAP," MicroStrategy, White Paper.
|
 |
26
|
|
| |
27
|
|
| |
28
|
28. Oracle Corp., "Oracle express OLAP technology," http://www.oracle.com/olap/index.html.
|
| |
29
|
29. Oracle Corp., "Oracle express server documentation," Oracle Corporation, Redwood Shores, CA, Documentation, August 2001.
|
| |
30
|
30. Pilot Software Inc., "An introduction to OLAP multidimensional terminology and technology," Pilot Software, Cambridge, MA, White Paper.
|
| |
31
|
31. Redbrick Systems, "Decision-makers, business data and RISQL," Informix, Los Gatos, CA, White Paper, 1997.
|
 |
32
|
|
 |
33
|
|
 |
34
|
|
| |
35
|
|
| |
36
|
36. Transaction Processing Performance Council, "The TPC Benchmark<sup>TM</sup>H," Transaction Processing Council, 2001, http://www.tpc.org/tpch/.
|
| |
37
|
37. Transaction Processing Performance Council. "Transaction processing performance council," 2001, http://www.tpc.org/.
|
| |
38
|
|
 |
39
|
|
| |
40
|
|
|