|
ABSTRACT
Building predictive models and finding useful rules are two important tasks of data mining. While building predictive models has been well studied, finding useful rules for action still presents a major problem. A main obstacle is that many data mining algorithms often produce too many rules. Existing research has shown that most of the discovered rules are actually redundant or insignificant. Pruning techniques have been developed to remove those spurious and/or insignificant rules. In this paper, we argue that being a significant rule (or a non-redundant rule), however, does not mean that it is a potentially useful rule for action. Many significant rules (unpruned rules) are in fact not actionable. This paper studies this issue and presents an efficient algorithm to identify these non-actionable rules. Experiment results on many real-life datasets show that the number of non-actionable rules is typically quite large. The proposed technique thus enables the user to focus on fewer rules and to be assured that the remaining rules are non-redundant and potentially useful for action.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Bayardo, R., Agrawal, R, and Gunopulos, D. "Constraintbased rule mining in large, dense databases." ICDE-99, 1999.
|
 |
5
|
Roberto J. Bayardo, Jr. , Rakesh Agrawal, Mining the most interesting rules, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.145-154, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312219]
|
| |
6
|
Everitt, B. S. The analysis of contingency tables. Chapman and Hall, 1977.
|
| |
7
|
Fayyad, U. M. and Irani, K. B. "Multi-interval discretization of continuous-valued attributes for classification learning." IJCAI-93, 1993.
|
| |
8
|
|
 |
9
|
Mika Klemettinen , Heikki Mannila , Pirjo Ronkainen , Hannu Toivonen , A. Inkeri Verkamo, Finding interesting rules from large sets of discovered association rules, Proceedings of the third international conference on Information and knowledge management, p.401-407, November 29-December 02, 1994, Gaithersburg, Maryland, United States
[doi> 10.1145/191246.191314]
|
| |
10
|
Kohavi, R., John, G., Long, R., Manley, D., and Pfieger, K. "MLC++: a machine learning library in C++." Tools with "artificial intelligence, 740-743, 1994.
|
| |
11
|
Liu, B., and Hsu, W. "Post-analysis of learned rules." AAAI- 96, 1996, pp. 828-834.
|
| |
12
|
Liu, B., Hsu, W. and Ma, Y. "Integrating classification and association rule mining." KDD-98, 1998.
|
 |
13
|
Bing Liu , Wynne Hsu , Yiming Ma, Pruning and summarizing the discovered associations, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.125-134, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312216]
|
| |
14
|
Liu, B., Hsu, W. and Ma, Y. Identifying non-actionable association rules. NUS Technical Report, 2001.
|
| |
15
|
Merz, C. J, and Murphy, P. UCI repository of machine learning databases, 1996. {http://www.cs.uci.edu/~mlearn/MLRepository.html}
|
 |
16
|
|
| |
17
|
Piatesky-Shapiro, G., and Matheus, C."The interestingness of deviations." KDD-94, 1994.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
Tan, P-N. & Kumar, V. "Interestingness measures for association patterns: a perspective." KDD-2000 Workshop on Post-processing in Machine Learning and Data Mining, 2000
|
 |
22
|
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
Miho Ohsaki , Hidenao Abe , Shusaku Tsumoto , Hideto Yokoi , Takahira Yamaguchi, Evaluation of rule interestingness measures in medical knowledge discovery in databases, Arificial Intelligence in Medicine, v.41 n.3, p.177-196, November, 2007
|
|
|
|
|