ACM Home Page
Please provide us with feedback. Feedback
Comparing association rules and decision trees for disease prediction
Full text PdfPdf (580 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the international workshop on Healthcare information and knowledge management table of contents
Arlington, Virginia, USA
SESSION: Medical data mining and applications table of contents
Pages: 17 - 24  
Year of Publication: 2006
ISBN:1-59593-528-2
Author
Carlos Ordonez  University of Houston, Houston, TX
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 257,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1183568.1183573
What is a DOI?

ABSTRACT

Association rules represent a promising technique to find hidden patterns in a medical data set. The main issue about mining association rules in a medical data set is the large number of rules that are discovered, most of which are irrelevant. Such number of rules makes search slow and interpretation by the domain expert difficult. In this work, search constraints are introduced to find only medically significant association rules and make search more efficient. In medical terms, association rules relate heart perfusion measurements and patient risk factors to the degree of stenosis in four specific arteries. Association rule medical significance is evaluated with the usual support and confidence metrics, but also lift. Association rules are compared to predictive rules mined with decision trees, a well-known machine learning technique. Decision trees are shown to be not as adequate for artery disease prediction as association rules. Experiments show decision trees tend to find few simple rules, most rules have somewhat low reliability, most attribute splits are different from medically common splits, and most rules refer to very small sets of patients. In contrast, association rules generally include simpler predictive rules, they work well with user-binned attributes, rule reliability is higher and rules generally refer to larger sets of patients.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
C. Becquet, S. Blachon, B. Jeudy, J.F. Boulicaut, and O. Gandrillon. Strong association-rule mining for large-scale gene-expression data analysis: a case study on human {SAGE data. Genom Biol., 3(12), 2002.
6
 
7
S.E. Brossette, A.P. Sprague, J.M. Hardin, K.B. Waites, W.T. Jones, and S.A. Moser. Association rules and data mining in hospital infection control and public health surveillance. J Am Med Inform Assoc. (JAMIA), 5(4):373--381, 1998.
 
8
S.E. Brossette, A.P. Sprague, W.T. Jones, and S.A. Moser. A data mining system for infection control surveillance. Methods Inf Med., 39(4):303--310, 2000.
 
9
 
10
T.J. Chen, L.F. Chou, and S.J. Hwang. Application of a data mining technique to analyze coprescription patterns for antacids in {Taiwan. Clin Ther, 25(9):2453--2463, 2003.
 
11
C. Creighton and S. Hanash. Mining gene expression databases for association rules. Bioinformatics, 19(1):79--86, 2003.
 
12
M. Delgado, D. Sanchez, M.J. Martin-Bautista, and M.A. Vila. Mining association rules with improved semantics in medical databases. Artificial Intelligence in Medicine, 21(1-3):241--5, 2001.
 
13
S.M. Down and M.Y. Wallace. Mining association rules from a pediatric primary care decision support system. In Proc of AMIA Symp., pages 200--204, 2000.
 
14
U. Fayyad and G. Piateski-Shapiro. From Data Mining to Knowledge Discovery. MIT Press, 1995.
 
15
H.S. Fraser, W.J. Long, and S. Naimi. Evaluation of a cardiac diagnostic program in a typical clinical setting. J Am Med Inform Assoc. (JAMIA), 10(4):373--381, 2003.
16
 
17
 
18
T. Hastie, R. Tibshirani, and J.H. Friedman. The Elements of Statistical Learning. Springer, New York, 1st edition, 2001.
 
19
20
 
21
 
22
 
23
W.J. Long, H.S. Fraser, and S. Naimi. Reasoning requirements for diagnosis of heart disease. Artificial Intelligence in Medicine, 10(1):5--24, 1997.
24
 
25
 
26
 
27
T. Oyama, K. Kitano, T. Satou, and T. Ito. Extraction of knowledge on protein-protein interaction by association rule discovery. Bioinformatics, 18(5):705--714, 2002.
 
28
29
 
30
31
 
32
R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In ACM KDD Conference, pages 67--73, 1997.
 
33
 
34