ACM Home Page
Please provide us with feedback. Feedback
Efficient closed pattern mining in the presence of tough block constraints
Full text PdfPdf (289 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
SESSION: Research track papers table of contents
Pages: 138 - 147  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Krishna Gade  University of Minnesota, Minneapolis, MN
Jianyong Wang  University of Minnesota, Minneapolis, MN
George Karypis  University of Minnesota, Minneapolis, MN
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 50,   Citation Count: 10
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014070
What is a DOI?

ABSTRACT

Various constrained frequent pattern mining problem formulations and associated algorithms have been developed that enable the user to specify various itemset-based constraints that better capture the underlying application requirements and characteristics. In this paper we introduce a new class of block constraints that determine the significance of an itemset pattern by considering the dense block that is formed by the pattern's items and its associated set of transactions. Block constraints provide a natural framework by which a number of important problems can be specified and make it possible to solve numerous problems on binary and real-valued datasets. However, developing computationally efficient algorithms to find these block constraints poses a number of challenges as unlike the different itemset-based constraints studied earlier, these block constraints are tough as they are neither anti-monotone, monotone, nor convertible. To overcome this problem, we introduce a new class of pruning methods that significantly reduce the overall search space and present a computationally efficient and scalable algorithm called CBMiner to find the closed itemsets that satisfy the block constraints.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Agarwal, C. Aggarwal, V. Prasad, and V. Crestana. A tree projection algorithm for generation of large itemsets for association rules. IBM Research Report, RC21341, November 1998.
 
2
3
 
4
5
6
 
7
 
8
A. Grama, A. Gupta, G. Karypis, and V. Kumar. Introduction to Parallel Computing: Design and Analysis of Algorithms, 2nd Edition. Adison Wesley Publishing Company, 2003.
9
 
10
K. Gade, J. Wang, and G. Karypis. Effcient Closed Pattern Mining in the Presence of Tough Block Constraints. Technical Report TR #03--45, Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, 2003. Available on the WWW at: http://cs.umn.edu/~karypis/publications.
11
12
13
14
15
16
 
17
18
 
19
 
20
 
21
J. Pei, J. Han, and R. Mao. Closet: An efficient algorithm for mining frequent closed itemsets. In Proc. of the DMKD'00, May 2000.
 
22
 
23
R. Srikant, Q. Vu, and R. Agrawal. Mining associations rules with item constraints. In Proc. of the ACM SIGKDD'97, August 1997.
24
 
25
 
26
K. Wang, Y. Jiang, J. Xu Yu, G. Dong, and J. Han. Pusing aggregate constraints by divide-and-approximate. In Proc. of the ICDE'03, March 2003.
 
27
J. Wang and G. Karypis. BAMBOO: Accelerating closed itemset mining by deeply pushing the length-decreasing support constraint. In Proc. of the SDM'04, April 2004.
28
 
29
M. Zaki and C. Hsiao. Charm: An efficient algorithm for closed itemset mining. In Proc. of the SDM'02, April 2002.

CITED BY  10

Collaborative Colleagues:
Krishna Gade: colleagues
Jianyong Wang: colleagues
George Karypis: colleagues