|
ABSTRACT
The paper presents a method for pruning frequent itemsets based on background knowledge represented by a Bayesian network. The interestingness of an itemset is defined as the absolute difference between its support estimated from data and from the Bayesian network. Efficient algorithms are presented for finding interestingness of a collection of frequent itemsets, and for finding all attribute sets with a given minimum interestingness. Practical usefulness of the algorithms and their efficiency have been verified experimentally.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
 |
2
|
Roberto J. Bayardo, Jr. , Rakesh Agrawal, Mining the most interesting rules, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.145-154, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312219]
|
| |
3
|
Susanne G. Bottcher and Claus Dethlefsen. Deal: A package for learning bayesian networks. www.math.auc.dk/novo/Publications/ bottcher:dethlefsen:03.ps, 2003.
|
| |
4
|
|
 |
5
|
|
 |
6
|
|
 |
7
|
Venky Harinarayan , Anand Rajaraman , Jeffrey D. Ullman, Implementing data cubes efficiently, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.205-216, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
8
|
David Heckerman. A tutorial on learning with Bayesian networks. Technical Report MSR-TR-95-06, Microsoft Research, Redmond, WA, 1995.
|
| |
9
|
|
| |
10
|
C. Huang and A. Darwiche. Inference in belief networks: A procedural guide. Intl. Journal of Approximate Reasoning, 15(3):225--263, 1996.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
Bing Liu, Wynne Hsu, and Shu Chen. Using general impressions to analyze discovered classification rules. In David Heckerman, Heikki Mannila, Daryl Pregibon, and Ramasamy Uthurusamy, editors, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), page 31. AAAI Press, 1997.
|
 |
15
|
Bing Liu , Wynne Hsu , Yiming Ma , Shu Chen, Mining interesting knowledge using DM-II, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.430-434, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312311]
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
Kevin Murphy. A brief introduction to graphical models and bayesian networks. http://www.ai.mit.edu/~murphyk/Bayes/bnintro.html, 1998.
|
 |
20
|
|
| |
21
|
|
| |
22
|
P. Myllymäki, T. Silander, H. Tirri, and P.Uronen. B-course: A web-based tool for bayesian and causal data analysis. International Journal on Artificial Intelligence Tools, 11(3):369--387, 2002.
|
| |
23
|
D. Poole and N. L. Zhang. Exploiting contextual independence in probablisitic inference. Journal of Artificial Intelligence Research, 18:263--313, 2003.
|
| |
24
|
D. Shah, L. V. S. Lakshmanan, K. Ramamritham, and S. Sudarshan. Interestingness and pruning of mined patterns. In 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 1999.
|
| |
25
|
Abraham Silberschatz and Alexander Tuzhilin. On subjective measures of interestingness in knowledge discovery. In Knowledge Discovery and Data Mining, pages 275--281, 1995.
|
| |
26
|
E. Suzuki. Autonomous discovery of reliable exception rules. In David Heckerman, Heikki Mannila, Daryl Pregibon, and Ramasamy Uthurusamy, editors, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), page 259. AAAI Press, 1997.
|
| |
27
|
|
 |
28
|
|
 |
29
|
|
CITED BY 14
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bing Liu , Kaidi Zhao , Jeffrey Benkler , Weimin Xiao, Rule interestingness analysis using OLAP operations, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
Dong Xin , Hong Cheng , Xifeng Yan , Jiawei Han, Extracting redundancy-aware top-k patterns, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
Kaidi Zhao , Bing Liu , Jeffrey Benkler , Weimin Xiao, Opportunity map: identifying causes of failure - a deployed data mining system, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yen-Ting Kuo , Andrew Lonie , Liz Sonenberg , Kathy Paizis, Domain ontology driven data mining: a medical case study, Proceedings of the 2007 international workshop on Domain driven data mining, p.11-17, August 12-12, 2007, San Jose, California
|
|
|
|
|
|
|
|