ACM Home Page
Please provide us with feedback. Feedback
Identifying non-actionable association rules
Full text PdfPdf (581 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
San Francisco, California
Pages: 329 - 334  
Year of Publication: 2001
ISBN:1-58113-391-X
Authors
Bing Liu  National University of Singapore, Singapore
Wynne Hsu  National University of Singapore, Singapore
Yiming Ma  National University of Singapore, Singapore
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
AAAI : American Association for Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 42,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/502512.502560
What is a DOI?

ABSTRACT

Building predictive models and finding useful rules are two important tasks of data mining. While building predictive models has been well studied, finding useful rules for action still presents a major problem. A main obstacle is that many data mining algorithms often produce too many rules. Existing research has shown that most of the discovered rules are actually redundant or insignificant. Pruning techniques have been developed to remove those spurious and/or insignificant rules. In this paper, we argue that being a significant rule (or a non-redundant rule), however, does not mean that it is a potentially useful rule for action. Many significant rules (unpruned rules) are in fact not actionable. This paper studies this issue and presents an efficient algorithm to identify these non-actionable rules. Experiment results on many real-life datasets show that the number of non-actionable rules is typically quite large. The proposed technique thus enables the user to focus on fewer rules and to be assured that the remaining rules are non-redundant and potentially useful for action.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
Bayardo, R., Agrawal, R, and Gunopulos, D. "Constraintbased rule mining in large, dense databases." ICDE-99, 1999.
5
 
6
Everitt, B. S. The analysis of contingency tables. Chapman and Hall, 1977.
 
7
Fayyad, U. M. and Irani, K. B. "Multi-interval discretization of continuous-valued attributes for classification learning." IJCAI-93, 1993.
 
8
9
 
10
Kohavi, R., John, G., Long, R., Manley, D., and Pfieger, K. "MLC++: a machine learning library in C++." Tools with "artificial intelligence, 740-743, 1994.
 
11
Liu, B., and Hsu, W. "Post-analysis of learned rules." AAAI- 96, 1996, pp. 828-834.
 
12
Liu, B., Hsu, W. and Ma, Y. "Integrating classification and association rule mining." KDD-98, 1998.
13
 
14
Liu, B., Hsu, W. and Ma, Y. Identifying non-actionable association rules. NUS Technical Report, 2001.
 
15
Merz, C. J, and Murphy, P. UCI repository of machine learning databases, 1996. {http://www.cs.uci.edu/~mlearn/MLRepository.html}
16
 
17
Piatesky-Shapiro, G., and Matheus, C."The interestingness of deviations." KDD-94, 1994.
 
18
19
 
20
 
21
Tan, P-N. & Kumar, V. "Interestingness measures for association patterns: a perspective." KDD-2000 Workshop on Post-processing in Machine Learning and Data Mining, 2000
22


Collaborative Colleagues:
Bing Liu: colleagues
Wynne Hsu: colleagues
Yiming Ma: colleagues