|
ABSTRACT
Association rules represent a promising technique to find hidden patterns in a medical data set. The main issue about mining association rules in a medical data set is the large number of rules that are discovered, most of which are irrelevant. Such number of rules makes search slow and interpretation by the domain expert difficult. In this work, search constraints are introduced to find only medically significant association rules and make search more efficient. In medical terms, association rules relate heart perfusion measurements and patient risk factors to the degree of stenosis in four specific arteries. Association rule medical significance is evaluated with the usual support and confidence metrics, but also lift. Association rules are compared to predictive rules mined with decision trees, a well-known machine learning technique. Decision trees are shown to be not as adequate for artery disease prediction as association rules. Experiments show decision trees tend to find few simple rules, most rules have somewhat low reliability, most attribute splits are different from medically common splits, and most rules refer to very small sets of patients. In contrast, association rules generally include simpler predictive rules, they work well with user-binned attributes, rule reliability is higher and rules generally refer to larger sets of patients.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
C. Becquet, S. Blachon, B. Jeudy, J.F. Boulicaut, and O. Gandrillon. Strong association-rule mining for large-scale gene-expression data analysis: a case study on human {SAGE data. Genom Biol., 3(12), 2002.
|
 |
6
|
Sergey Brin , Rajeev Motwani , Jeffrey D. Ullman , Shalom Tsur, Dynamic itemset counting and implication rules for market basket data, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.255-264, May 11-15, 1997, Tucson, Arizona, United States
|
| |
7
|
S.E. Brossette, A.P. Sprague, J.M. Hardin, K.B. Waites, W.T. Jones, and S.A. Moser. Association rules and data mining in hospital infection control and public health surveillance. J Am Med Inform Assoc. (JAMIA), 5(4):373--381, 1998.
|
| |
8
|
S.E. Brossette, A.P. Sprague, W.T. Jones, and S.A. Moser. A data mining system for infection control surveillance. Methods Inf Med., 39(4):303--310, 2000.
|
| |
9
|
|
| |
10
|
T.J. Chen, L.F. Chou, and S.J. Hwang. Application of a data mining technique to analyze coprescription patterns for antacids in {Taiwan. Clin Ther, 25(9):2453--2463, 2003.
|
| |
11
|
C. Creighton and S. Hanash. Mining gene expression databases for association rules. Bioinformatics, 19(1):79--86, 2003.
|
| |
12
|
M. Delgado, D. Sanchez, M.J. Martin-Bautista, and M.A. Vila. Mining association rules with improved semantics in medical databases. Artificial Intelligence in Medicine, 21(1-3):241--5, 2001.
|
| |
13
|
S.M. Down and M.Y. Wallace. Mining association rules from a pediatric primary care decision support system. In Proc of AMIA Symp., pages 200--204, 2000.
|
| |
14
|
U. Fayyad and G. Piateski-Shapiro. From Data Mining to Knowledge Discovery. MIT Press, 1995.
|
| |
15
|
H.S. Fraser, W.J. Long, and S. Naimi. Evaluation of a cardiac diagnostic program in a typical clinical setting. J Am Med Inform Assoc. (JAMIA), 10(4):373--381, 2003.
|
 |
16
|
|
| |
17
|
|
| |
18
|
T. Hastie, R. Tibshirani, and J.H. Friedman. The Elements of Statistical Learning. Springer, New York, 1st edition, 2001.
|
| |
19
|
|
 |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
W.J. Long, H.S. Fraser, and S. Naimi. Reasoning requirements for diagnosis of heart disease. Artificial Intelligence in Medicine, 10(1):5--24, 1997.
|
 |
24
|
Raymond T. Ng , Laks V. S. Lakshmanan , Jiawei Han , Alex Pang, Exploratory mining and pruning optimizations of constrained associations rules, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.13-24, June 01-04, 1998, Seattle, Washington, United States
|
| |
25
|
|
| |
26
|
Carlos Ordonez , Edward Omiecinski , Levien de Braal , Cesar A. Santana , Norberto Ezquerra , Jose A. Taboada , David Cooke , Elizabeth Krawczynska , Ernest V. Garcia, Mining Constrained Association Rules to Predict Heart Disease, Proceedings of the 2001 IEEE International Conference on Data Mining, p.433-440, November 29-December 02, 2001
|
| |
27
|
T. Oyama, K. Kitano, T. Satou, and T. Ito. Extraction of knowledge on protein-protein interaction by association rule discovery. Bioinformatics, 18(5):705--714, 2002.
|
| |
28
|
|
 |
29
|
|
| |
30
|
|
 |
31
|
|
| |
32
|
R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In ACM KDD Conference, pages 67--73, 1997.
|
| |
33
|
|
| |
34
|
|
|