| Frequent pattern mining with uncertain data |
| Full text |
Pdf
(461 KB)
|
Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Paris, France
SESSION: Research track papers
table of contents
Pages 29-38
Year of Publication: 2009
ISBN:978-1-60558-495-9
|
|
Authors
|
|
Charu C. Aggarwal
|
IBM T. J. Watson Research Ctr, Hawthorne, NY, USA
|
|
Yan Li
|
Tsinghua University, Beijing, China
|
|
Jianyong Wang
|
Tsinghua University, Beijing, China
|
|
Jing Wang
|
New York University, New York, NY, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 118, Downloads (12 Months): 380, Citation Count: 0
|
|
|
ABSTRACT
This paper studies the problem of frequent pattern mining with uncertain data. We will show how broad classes of algorithms can be extended to the uncertain data setting. In particular, we will study candidate generate-and-test algorithms, hyper-structure algorithms and pattern growth based algorithms. One of our insightful observations is that the experimental behavior of different classes of algorithms is very different in the uncertain case as compared to the deterministic case. In particular, the hyper-structure and the candidate generate-and-test algorithms perform much better than tree-based algorithms. This counter-intuitive behavior is an important observation from the perspective of algorithm design of the uncertain variation of the problem. We will test the approach on a number of real and synthetic data sets, and show the effectiveness of two of our approaches over competitive techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
F. Bodon. A fast APRIORI implementation. URL: {http://fimi.cs.helsinki.fi/src/}.
|
| |
6
|
|
| |
7
|
C.-K. Chui, B. Kao, E. Hung. Mining Frequent Itemsets from Uncertain Data. PAKDD 2007.
|
| |
8
|
C.-K. Chui, B. Kao. Decremental Approach for Mining Frequent Itemsets from Uncertain Data. PAKDD 2008.
|
 |
9
|
Jiawei Han , Jian Pei , Yiwen Yin, Mining frequent patterns without candidate generation, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, p.1-12, May 15-18, 2000, Dallas, Texas, United States
|
 |
10
|
|
| |
11
|
C. K.-S. Leung, M. A. F. Mateo, D. A. Brajczuk. A Tree-Based Approach for Frequent Pattern Mining from UncertainData, PAKDD 2008.
|
| |
12
|
Jian Pei , Jiawei Han , Hongjun Lu , Shojiro Nishio , Shiwei Tang , Dongqing Yang, H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases, Proceedings of the 2001 IEEE International Conference on Data Mining, p.441-448, November 29-December 02, 2001
|
| |
13
|
Y.G. Sucahyo, R.P. Gopalan. CT-PRO: A Bottom-up Non-Recursive Frequent Itemset Mining Algorithm Using Compressed FP-Tree DataStructure. URL: {http://fimi.cs.helsinki.fi/src/}.
|
 |
14
|
|
|