|
ABSTRACT
Cost-sensitive learning addresses the issue of classification in the presence of varying costs associated with different types of misclassification. In this paper, we present a method for solving multi-class cost-sensitive learning problems using any binary classification algorithm. This algorithm is derived using hree key ideas: 1) iterative weighting; 2) expanding data space; and 3) gradient boosting with stochastic ensembles. We establish some theoretical guarantees concerning the performance of this method. In particular, we show that a certain variant possesses the boosting property, given a form of weak learning assumption on the component binary classifier. We also empirically evaluate the performance of the proposed method using benchmark data sets and verify that our method generally achieves better results than representative methods for cost-sensitive learning, in terms of predictive performance (cost minimization) and, in many cases, computational efficiency.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
S. D. Bay. UCI KDD archive. Department of Information and Computer Sciences, University of California, Irvine, 2000. http://kdd.ics.uci.edu/.
|
| |
3
|
C. L. Blake and C. J. Merz. UCI repository of machine learning databases. Department of Information and Computer Sciences, University of California, Irvine, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html.
|
| |
4
|
|
| |
5
|
|
| |
6
|
L. Breiman, J. H. Friedman, R. A. Olsen, and C. J. Stone. Classification and Regression Trees. Wadsworth International Group, 1984.
|
| |
7
|
P. Chan and S. Stolfo. Toward scalable learning with non-uniform class and cost distributions. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 164--168, 1998.
|
 |
8
|
|
| |
9
|
C. Drummond and R. C. Holte. C4.5, class imbalance, and cost-sensitivity: Why under-sampling beats over-sampling. In Workshop Notes, Workshop on Cost-Sensitive Learning, International Conference on Machine Learning, June 2000.
|
 |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
G. Fumera and F. Roli. Cost-sensitive learning in support vector machines. In VIII Convegno Associazione Italiana per L'Intelligenza Artificiale, 2002.
|
| |
14
|
P. Geibel and F. Wysotzki. Perceptron based learning with example dependent and noisy costs. In Proceedings of the Twentieth International Conference on Machine Learning, 2003.
|
| |
15
|
|
| |
16
|
|
| |
17
|
L. Mason, J. Baxter, P. Barlett, and M. Frean. Boosting algorithms as gradient descent. In Advances in Neural Information Processing systems 12, pages 512--158, 2000.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
CITED BY 15
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Taichi Noro , Takashi Inui , Hiroya Takamura , Manabu Okumura, Time period identification of events in text, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.1153-1160, July 17-18, 2006, Sydney, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fen Xia , Yan-wu Yang , Liang Zhou , Fuxin Li , Min Cai , Daniel D. Zeng, A closed-form reduction of multi-class cost-sensitive learning to weighted multi-class learning, Pattern Recognition, v.42 n.7, p.1572-1581, July, 2009
|
|
|
|
|
|
|
|
|
|
|
|
Xingquan Zhu , Xindong Wu , Taghi M. Khoshgoftaar , Yong Shi, An empirical study of the noise impact on cost-sensitive learning, Proceedings of the 20th international joint conference on Artifical intelligence, p.1168-1173, January 06-12, 2007, Hyderabad, India
|
|
|
|
|