ACM Home Page
Please provide us with feedback. Feedback
Range queries in OLAP data cubes
Full text PdfPdf (1.91 MB)
Source International Conference on Management of Data archive
Proceedings of the 1997 ACM SIGMOD international conference on Management of data table of contents
Tucson, Arizona, United States
Pages: 73 - 88  
Year of Publication: 1997
ISBN:0-89791-911-4
Also published in ...
Authors
Ching-Tien Ho  IBM Almaden Research Center, 650 Harry Road, San Jose, CA
Rakesh Agrawal  IBM Almaden Research Center, 650 Harry Road, San Jose, CA
Nimrod Megiddo  IBM Almaden Research Center, 650 Harry Road, San Jose, CA
Ramakrishnan Srikant  IBM Almaden Research Center, 650 Harry Road, San Jose, CA
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 113,   Citation Count: 68
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/253260.253274
What is a DOI?

ABSTRACT

A range query applies an aggregation operation over all selected cells of an OLAP data cube where the selection is specified by providing ranges of values for numeric dimensions. We present fast algorithms for range queries for two types of aggregation operations: SUM and MAX. These two operations cover techniques required for most popular aggregation operations, such as those supported by SQL. For range-sum queries, the essential idea is to precompute some auxiliary information (prefix sums) that is used to answer ad hoc queries at run-time. By maintaining auxiliary information which is of the same size as the data cube, all range queries for a given cube can be answered in constant time, irrespective of the size of the sub-cube circumscribed by a query. Alternatively, one can keep auxiliary information which is 1/bd of the size of the d-dimensional data cube. Response to a range query may now require access to some cells of the data cube in addition to the access to the auxiliary information, but the overall time complexity is typically reduced significantly. We also discuss how the precomputed information is incrementally updated by batching updates to the data cube. Finally, we present algorithms for choosing the subset of the data cube dimensions for which the auxiliary information is computed and the blocking factor to use for each such subset. Our approach to answering range-max queries is based on precomputed max over balanced hierarchical tree structures. We use a branch-and-bound-like procedure to speed up the finding of max in a region. We also show that with a branch-and-bound procedure, the average-case complexity is much smaller than the worst-case complexity.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
AAD+96
 
AGS97
Ben80
BF79
BKSS90
 
BR91
M. Berger and I. Regoutsos. An algorithm point clustering and grid generation. IEEE Transactions on Systems, Man and Cybernetics, 21(5):1278-86, 1991.
Cha90
 
CLS86
G.D. Cohen, A.C. Lobstein, and N.j.A. Sloane. b-k~her results on the covering radius of codes. IEJEB Trans. Information Theory, IT-32(5):680- 694, September 1986.
 
CM89
 
Cod93
E.F. Codd. Providing OLAP (on-line analytical processing) to user-analysts: An IT mandate. Technical report, E. F. Codd and Associates, 1993.
Col96
Com79
CR89
 
CS94
 
GBLP96
 
GHQ95
 
GHRU97
HBA97
HRU96
 
IBM95
IBM. DB~ SQL Reference }or Common Servers Version ~, 1995.
 
JD88
 
JS96
T. Johnson and D. Shasha. Hierarchically sprit cube forests for decision support: description and tuned design, 1996. Working Paper.
 
Lom95
D. Lomet, editor. Special Issue on Materialized Views and Data Warehousing. IEEE Data Engineering Bulletin, 18(2), June 1995.
 
Meh84
 
Mic92
 
Mit70
L. Mitten. Branch and bound methods: General formulation and properties. Operations Research, 18:24-34, 1970.
 
OLA96
The CLAP Council. MD-API the OLAP Applzcation Program Interface Version 0.5 Specification, September 1996.
 
RND77
E.M. Reingold, J. Nievergelt, and N. Dec. Combinatorial Algorithms. Prentice-Hall, Englewood Cliffs. N J, 1977.
 
Sam89
 
SAM96
 
SB95
P. Schroeter and j. Bigun. Hierarchical image segmentation by multi-dimensional clustering and orientation-adaptive boundary refinement. Pattern Recognition, 25(5):695-709, May 1995.
 
SDNR96
 
SR96
B. Salzberg and A. Reuter. Indexing for aggregation, 1996. Working Paper.
 
STL89
Vai85
WL85
 
Yao85
Andrew Yao. On the complexity of maintaining partial sums. SIAM J. Computing, 14(2):277- 288, May 1985.
 
YL95

CITED BY  68

Collaborative Colleagues:
Ching-Tien Ho: colleagues
Rakesh Agrawal: colleagues
Nimrod Megiddo: colleagues
Ramakrishnan Srikant: colleagues