|
ABSTRACT
In this paper, we propose OP-TKC (Order Preserving Top K Closed itemsets) algorithm for mining top-k frequent closed itemsets. Our methodology visits the closed itemsets lattice in breadth first manner and generates all the top-k closed itemsets without generating all the closed itemsets of a given dataset i.e. in the search space, only closed itemsets that belongs to top-k are expanded and all other closed itemsets are pruned off. Our algorithm computes all the top-k closed itemsets with O(D+ k) space complexity, where D is the dataset. Experiments involving publicly available datasets show that our algorithm takes less memory and running time than TFP algorithm.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
K. Makino and T. Uno, "New algorithms for enumerating all maximal cliques," in Proceedings of the 9th Scandinavian Workshop on Algorithm Theory (SWAT 2004), Springer-Verlag, 2004, pp. 260--272.
|
 |
11
|
|
 |
12
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
[doi> 10.1145/170035.170072]
|
| |
13
|
T. Uno, M. Kiyomi, and H. Arimura, "LCM ver. 2: Efficient mining algorithms for Frequent/closed/maximal itemsets," In Proc. IEEE ICDM'04 Workshop FIMI'04, 2004.
|
| |
14
|
|
| |
15
|
C. Lucchese, S. Orlando, P. Palmerini, R. Perego, and F. Silvestri, "KDCI: A Multi-Strategy Algorithm for Mining Frequent Sets," Proc. IEEE ICDM FIMI'03 Workshop, Dec 2003
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
Jiawei Han , Jian Pei , Yiwen Yin, Mining frequent patterns without candidate generation, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.1-12, May 15-18, 2000, Dallas, Texas, United States
[doi> 10.1145/342009.335372]
|
 |
20
|
|
| |
21
|
Ferenc Bodon, "A Fast APRIORI Implementation," IEEE ICDM FIMI'03, USA, 2003.
|
| |
22
|
M. J. Zaki, S. Parthasarathy, M. Ogihara and W. Li, "New Algorithms for fast Discovery of Association Rules", Proc. 3rd Intl. Conf. Knowledge Discovery and Data Mining, pages: 283--286, 1997
|
| |
23
|
I. Rigoutsos and A. Floratos, "Combinatorial Pattern Discovery in Biological Sequences: The Teiresias Algorithm, Bioinformatics, Vol 14, pages 55--67, 1998.
|
 |
24
|
|
| |
25
|
|
| |
26
|
C. Lucchese, S. Orlando, P. Palmerini, R. Perego and F. Silvestri, "KDCI: A multistrategy Algorithm for Mining Frequent Sets," In Proc. IEEE ICDM'03 Workshop FIMI'03, 2003.
|
| |
27
|
|
 |
28
|
Feng Pan , Gao Cong , Anthony K. H. Tung , Jiong Yang , Mohammed J. Zaki, Carpenter: finding closed patterns in long biological datasets, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2003, Washington, D.C.
[doi> 10.1145/956750.956832]
|
|