ACM Home Page
Please provide us with feedback. Feedback
Efficient computation of Iceberg cubes with complex measures
Full text PdfPdf (400 KB)
Source International Conference on Management of Data archive
Proceedings of the 2001 ACM SIGMOD international conference on Management of data table of contents
Santa Barbara, California, United States
Pages: 1 - 12  
Year of Publication: 2001
ISBN:1-58113-332-4
Also published in ...
Authors
Jiawei Han  School of Computing Science, Simon Fraser University, B.C., Canada
Jian Pei  School of Computing Science, Simon Fraser University, B.C., Canada
Guozhu Dong  Department of Computer Science, Wright State University, Dayton, OH
Ke Wang  School of Computing Science, Simon Fraser University, B.C., Canada
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 44
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/375663.375664
What is a DOI?

ABSTRACT

It is often too expensive to compute and materialize a complete high-dimensional data cube. Computing an iceberg cube, which contains only aggregates above certain thresholds, is an effective way to derive nontrivial multi-dimensional aggregations for OLAP and data mining.

In this paper, we study efficient methods for computing iceberg cubes with some popularly used complex measures, such as average, and develop a methodology that adopts a weaker but anti-monotonic condition for testing and pruning search space. In particular, for efficient computation of iceberg cubes with the average measure, we propose a top-k average pruning method and extend two previously studied methods, Apriori and BUC, to Top-k Apriori and Top-k BUC. To further improve the performance, an interesting hypertree structure, called H-tree, is designed and a new iceberg cubing method, called Top-k H-Cubing, is developed. Our performance study shows that Top-k BUC and Top-k H-Cubing are two promising candidates for scalable computation, and Top-k H-Cubing has better performance in most cases.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
R. J. Bayardo, R. Agrawal, and D. Gunopulos. Constraint-based rule mining on large, dense data sets. ICDE'99.
4
5
 
6
 
7
G. Grahne, L. Lakshmanan, and X. Wang. Efficient mining of constrained correlated sets. ICDE'00.
 
8
9
10
11
12
13
 
14
 
15
R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. KDD'97.
16

CITED BY  44

Collaborative Colleagues:
Jiawei Han: colleagues
Jian Pei: colleagues
Guozhu Dong: colleagues
Ke Wang: colleagues