ACM Home Page
Please provide us with feedback. Feedback
CubiST++: Evaluating Ad-Hoc CUBE Queries Using Statistics Trees
Full text Publisher SitePublisher Site
Source Distributed and Parallel Databases archive
Volume 14 ,  Issue 3  (November 2003) table of contents
Pages: 221 - 254  
Year of Publication: 2003
ISSN:0926-8782
Authors
Joachim Hammer  Computer & Information Science & Eng., University of Florida, Gainesville, FL 32611-6120, USA. jhammer@cise.ufl.edu
Lixin Fu  Division of Computer Science, University of North Carolina, Greensboro, Greensboro, NC 27402-6170, USA. lfu@uncg.edu
Publisher
Kluwer Academic Publishers  Hingham, MA, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: 10.1023/A:1025537315785

ABSTRACT

We report on a new, efficient encoding for the data cube, which results in a drastic speed-up of OLAP queries that aggregate along any combination of dimensions over numerical and categorical attributes. We are focusing on a class of queries called cube queries, which return aggregated values rather than sets of tuples. Our approach, termed CubiST++ (Cubing with Statistics Trees Plus Families), represents a drastic departure from existing relational (ROLAP) and multi-dimensional (MOLAP) approaches in that it does not use the view lattice to compute and materialize new views from existing views in some heuristic fashion. Instead, CubiST++ encodes all possible aggregate views in the leaves of a new data structure called statistics tree (ST) during a one-time scan of the detailed data. In order to optimize the queries involving constraints on hierarchy levels of the underlying dimensions, we select and materialize a family of candidate trees, which represent superviews over the different hierarchical levels of the dimensions. Given a query, our query evaluation algorithm selects the smallest tree in the family, which can provide the answer. Extensive evaluations of our prototype implementation have demonstrated its superior run-time performance and scalability when compared with existing MOLAP and ROLAP systems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
3. Arbor Systems, "Large-scale data warehousing using hyperion essbase OLAP technology," Arbor Systems, White Paper 1997.
4
5
6
7
 
8
8. E.F. Codd, S.B. Codd, and C.T. Salley, "Beyond decision support," in Computer World, vol. 27, 1993, www.arborsoft.com/OLAP.html.
 
9
9. E.F. Codd, S.B. Codd, and C.T. Salley, "Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate," Technical Report 1993.
10
 
11
12
 
13
 
14
 
15
 
16
16. H. Gupta and I. Mumick, "Selection of views to materialize under a maintenance cost constraint," Stanford University, Technical Report, 1997.
17
 
18
18. Information Advantage, "Business intelligence," White Paper, 1998.
 
19
19. Informix Corp., "Informix red brick decision server," 2001, http://www.informix.com/redbrick/.
 
20
20. T. Johnson and D. Shasha, "Some approaches to index design for cube forests," Bulletin of the Technical Committee on Data Engineering, IEEE Computer Society, vol. 20, pp. 27-35, 1997.
 
21
 
22
22. M. Lee and J. Hammer, "Speeding up warehouse physical design using a randomized algorithm," International Journal of Cooperative information Systems (IJCIS)--Special Issue on Design and Management of Data Warehouses, vol. 10, pp. 327-354, 2001.
 
23
23. D. Lomet, "Bulletin of the technical committee on data engineering," in Special Issue on Materialized Views and Data Warehousing, J. Widom (Ed.), IEEE Computer Society, 1995, vol. 18.
 
24
24. Microsoft Corp., "Microsoft SQL server," Microsoft, Seattle, WA, White Paper.
 
25
25. MicroStrategy Inc., "The case for relational OLAP," MicroStrategy, White Paper.
26
 
27
 
28
28. Oracle Corp., "Oracle express OLAP technology," http://www.oracle.com/olap/index.html.
 
29
29. Oracle Corp., "Oracle express server documentation," Oracle Corporation, Redwood Shores, CA, Documentation, August 2001.
 
30
30. Pilot Software Inc., "An introduction to OLAP multidimensional terminology and technology," Pilot Software, Cambridge, MA, White Paper.
 
31
31. Redbrick Systems, "Decision-makers, business data and RISQL," Informix, Los Gatos, CA, White Paper, 1997.
32
33
34
 
35
 
36
36. Transaction Processing Performance Council, "The TPC Benchmark<sup>TM</sup>H," Transaction Processing Council, 2001, http://www.tpc.org/tpch/.
 
37
37. Transaction Processing Performance Council. "Transaction processing performance council," 2001, http://www.tpc.org/.
 
38
39
 
40

Collaborative Colleagues:
Joachim Hammer: colleagues
Lixin Fu: colleagues