| LEGAL-tree: a lexicographic multi-objective genetic algorithm for decision tree induction |
| Full text |
Pdf
(286 KB)
|
Source
|
Symposium on Applied Computing
archive
Proceedings of the 2009 ACM symposium on Applied Computing
table of contents
Honolulu, Hawaii
SESSION: Applications of evolutionary computation track
table of contents
Pages 1085-1090
Year of Publication: 2009
ISBN:978-1-60558-166-8
|
|
Authors
|
|
Márcio P. Basgalupp
|
University of São Paulo, São Carlos -- SP, Brazil
|
|
Rodrigo C. Barros
|
Pontifical Catholic University of RS, Porto Alegre -- RS, Brazil
|
|
André C. P. L. F. de Carvalho
|
University of São Paulo, São Carlos -- SP, Brazil
|
|
Alex A. Freitas
|
University of Kent, Canterbury, Kent, United Kingdom
|
|
Duncan D. Ruiz
|
Pontifical Catholic University of RS, Porto Alegre -- RS, Brazil
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 40, Citation Count: 0
|
|
|
ABSTRACT
Decision trees are widely disseminated as an effective solution for classification tasks. Decision tree induction algorithms have some limitations though, due to the typical strategy they implement: recursive top-down partitioning through a greedy split evaluation. This strategy is limiting in the sense that there is quality loss while the partitioning process occurs, creating statistically insignificant rules. In order to prevent the greedy strategy and to avoid converging to local optima, we present a novel Genetic Algorithm for decision tree induction based on a lexicographic multi-objective approach, and we compare it with the most well-known algorithm for decision tree induction, J48, over distinct public datasets. The results show the feasibility of using this technique as a means to avoid the previously described problems, reporting not only a comparable accuracy but also, importantly, a significantly simpler classification model in the employed datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Breiman, L., Friedman, J., Olshen, R., Stone, C. 1984 Classification and Regression Trees. Wadsworth.
|
| |
3
|
Buntine, W. 1993. Learning Classification Trees. Statistics and Computing, v. 2 (2), 63--73.
|
| |
4
|
Coello, C., Veldhuizen, D., Lamont, G. 2002 Evolutionary Algorithms for Solving Multi-Objective Problems. Springer.
|
| |
5
|
|
| |
6
|
Jesús K. Estrada-Gil , Juan C. Fernández-López , Enrique Hernández-Lemus , Irma Silva-Zolezzi , Alfredo Hidalgo-Miranda , Gerardo Jiménez-Sánchez , Edgar E. Vallejo-Clemente, GPDTI, Bioinformatics, v.23 n.13, p.i167-i174, July 2007
[doi> 10.1093/bioinformatics/btm205]
|
| |
7
|
Freitas, Alex A. 2007. A Review of Evolutionary Algorithms for Data Mining. Soft Computing for Knowledge Discovery and Data Mining. Springer-Verlag New York, Inc., 79--111.
|
 |
8
|
|
| |
9
|
Freitas, Alex A., Wieser, Daniela C., Apweiler, Rolf. 2008. On the importance of comprehensible classification models for protein function prediction. To appear in IEEE/ACM Transactions on Computational Biology and Bioinformatics.
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
Loveard, T., Ciesielski, V. 2001. Representing classification problems in genetic programming. Proceedings of the 2001 Congress on Evolutionary Computation, 1070--1077.
|
| |
14
|
Loveard, T., Ciesielski, V. 2002. Employing nominal attributes in classification using genetic programming. Proceedings of the Fourth Asia-Pacific Conference on Simulated Evolution and Learning, 487--491.
|
| |
15
|
|
| |
16
|
|
| |
17
|
Newman, D. J., Hettich, S., Blake, C. L., Merz, C. J. 1998. UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/~mlearn/MLRepository.html.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Quinlan, J. R., 1996. Bagging, boosting, and C4.5. In Proceedings of the Fourteenth National Conference on Artificial Intelligence.
|
| |
22
|
Shirasaka, M., Zhao, Q., Hammarmi, O., Kuroda, K., Saito, K. 1998. Automatic design of binary decision trees based on genetic programming. Proceedings of the Second Asia-Pacific Conference on Simulated Evolution and Learning.
|
| |
23
|
Tür, G., Güvenir, H. A. 1996. Decision tree induction using genetic programming. Proceedings of the Fifth Turkish Symposium on Artificial Intelligence and Neural Networks.
|
| |
24
|
|
| |
25
|
|
| |
26
|
Zhao, Q., Shirasaka, M. 1999. A study on evolutionary design of binary decision trees, Proceedings of the 1999 Congress on Evolutionary Computation, 1988--1993.
|
|