ACM Home Page
Please provide us with feedback. Feedback
Towards efficient mining of proportional fault-tolerant frequent itemsets
Full text PdfPdf (710 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 697-706  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Ardian Kristanto Poernomo  Nanyang Technological University, Singapore, Singapore
Vivekanand Gopalkrishnan  Nanyang Technological University, Singapore, Singapore
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 36,   Downloads (12 Months): 117,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557097
What is a DOI?

ABSTRACT

Fault-tolerant frequent itemsets (FTFI) are variants of frequent itemsets for representing and discovering generalized knowledge. However, despite growing interest in this field, no previous approach mines proportional FTFIs with their exact support (FT-support).

This problem is difficult because of two concerns: (a) non anti-monotonic property of FT-support when relaxation is proportional, and (b) difficulty in computing FT-support. Previous efforts on this problem either simplify the general problem by adding constraints, or provide approximate solutions without any error guarantees.

In this paper, we address these concerns in the general FTFI mining problem. We limit the search space by providing provably correct anti monotone bounds for FT-support and develop practically efficient means of achieving them. Besides, we also provide an efficient and exact FT-support counting procedure.

Extensive experiments using real datasets validate that our solution is reasonably efficient for completely mining FTFIs. Implementations for the algorithms are available from www.cais.ntu.edu.sg/~vivek/pubs/ftfim09.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
J. Besson, R. G. Pensa, C. Robardet, and J.-F. Boulicaut. Constraint-based mining of fault-tolerant patterns from boolean data. In KDID, pages 55--71, 2005.
3
 
4
T. Calders and B. Goethals. Depth-first non-derivable itemset mining. In SDM, 2005.
 
5
T. Calders and B. Goethals. Quick inclusion-exclusion. In KDID, pages 86--103, 2005.
 
6
H. Cheng, P. S. Yu, and J. Han. Approximate frequent itemset mining in the presence of random noise. In Soft Computing for Knowledge Discovery and Data Mining, pages 363--389, 2007.
7
 
8
J. Li, K. Sim, G. Liu, and L. Wong. Maximal quasi-bicliques with balanced noise tolerance: Concepts and co-clustering applications. In SDM, pages 72--83, 2008.
 
9
 
10
J. Pei, A. K. H. Tung, and J. Han. Fault-tolerant frequent pattern mining: Problems and challenges. In DMKD, 2001.
 
11
 
12
A. K. Poernomo and V. Gopalkrishnan. Efficient computation of partial-support for mining interesting itemsets. In SDM, 2009.
 
13
J. K. Seppanen. Using and Extending Itemsets in Data Mining: Query Approximation, Dense Itemsets, and Tiles. PhD thesis, Helsinki University of Technology, May 2006.
14
 
15
16
 
17
M. J. Zaki, S. Parthasarathy, M. Ogihara, and W. Li. New algorithms for fast discovery of association rules. In KDD, pages 283--286, 1997.

Collaborative Colleagues:
Ardian Kristanto Poernomo: colleagues
Vivekanand Gopalkrishnan: colleagues