| Solving regression problems with rule-based ensemble classifiers |
| Full text |
Pdf
(557 KB)
|
| Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
San Francisco, California
Pages: 287 - 292
Year of Publication: 2001
ISBN:1-58113-391-X
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 35, Citation Count: 4
|
|
|
ABSTRACT
We describe a lightweight learning method that induces an ensemble of decision-rule solutions for regression problems. Instead of direct prediction of a continuous output variable, the method discretizes the variable by k-means clustering and solves the resultant classification problem. Predictions on new examples are made by averaging the mean values of classes with votes that are close in number to the most likely class. We provide experimental evidence that this indirect approach can often yield strong results for many applications, generally outperforming direct approaches such as regression trees and rivaling bagged regression trees.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
L. Breiman, J. Friedman, R. Olshen, and C. Stone. Classification and Regression Trees. Wadsworth, Monterrey, CA., 1984.
|
| |
4
|
R. Camacho. Inducing Models of Human Control Skills using Machine Learning Algorithms. Ph.d. thesis, University of Porto, 2000.
|
| |
5
|
William W. Cohen , Yoram Singer, A simple, fast, and effective rule learner, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, p.335-342, July 18-22, 1999, Orlando, Florida, United States
|
| |
6
|
J. Dougherty, R. Kohavi, and M. Sahami. Supervised and unsupervised discretization of continuous features. In Proceedings of the 12th Int'l Conference on Machine Learning, pages 194-202, 1995.
|
| |
7
|
U. Fayyad and K. Irani. Multi-interval discretization of continuous-valued attributes for classification learning. In Proceedings of the 13th Int'l Joint Conference on Artificial Intelligence, pages 1022-1027, 1993.
|
| |
8
|
J. Friedman. Multivariate adaptive regression splines. Annals of Statistics, 19(1):1-141, 1991.
|
| |
9
|
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: A statistical view of boosting. Technical report, Stanford University Statistics Department, 1998. www.stat-stanford.edu/~tibs.
|
| |
10
|
J. Hartigan and M. Wong. A k-means clustering algorithm, ALGORITHM AS 136. Applied Statistics, 28(1), 1979.
|
| |
11
|
|
| |
12
|
L. Torgo and J. Gama. Regression using classification algorithms. Intelligent Data Analysis, 1(4), 1997.
|
| |
13
|
S. Weiss and N. Indurkhya. Rule-based machine learning methods for functional prediction. Journal of Artificial Intelligence Research, 3:383-403, 1995.
|
| |
14
|
|
CITED BY 4
|
|
C. V. Apte , S. J. Hong , R. Natarajan , E. P. D. Pednault , F. A. Tipu , S. M. Weiss, Data-intensive analytics for predictive modeling, IBM Journal of Research and Development, v.47 n.1, p.17-23, January 2003
|
|
|
|
|
|
|
|
|
Ruben Nicolas , Elisabet Golobardes , Albert Fornells , Sonia Segura , Susana Puig , Cristina Carrera , Joseph Palou , Josep Malvehy, Using Ensemble-Based Reasoning to Help Experts in Melanoma Diagnosis, Proceeding of the 2008 conference on Artificial Intelligence Research and Development: Proceedings of the 11th International Conference of the Catalan Association for Artificial Intelligence, p.178-185, July 03, 2008
|
|