ACM Home Page
Please provide us with feedback. Feedback
Boosting for transfer learning
Full text PdfPdf (475 KB)
Source ICML; Vol. 227 archive
Proceedings of the 24th international conference on Machine learning table of contents
Corvalis, Oregon
Pages: 193 - 200  
Year of Publication: 2007
ISBN:978-1-59593-793-3
Authors
Wenyuan Dai  Shanghai Jiao Tong University, China
Qiang Yang  Hong Kong University of Science and Technology, Hong Kong
Gui-Rong Xue  Shanghai Jiao Tong University, China
Yong Yu  Shanghai Jiao Tong University, China
Sponsor
: Machine Learning Journal
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 123,   Citation Count: 9
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1273496.1273521
What is a DOI?

ABSTRACT

Traditional machine learning makes a basic assumption: the training and test data should be under the same distribution. However, in many cases, this identical-distribution assumption does not hold. The assumption might be violated when a task from one new domain comes, while there are only labeled data from a similar old domain. Labeling the new data can be costly and it would also be a waste to throw away all the old data. In this paper, we present a novel transfer learning framework called TrAdaBoost, which extends boosting-based learning algorithms (Freund & Schapire, 1997). TrAdaBoost allows users to utilize a small amount of newly labeled data to leverage the old data to construct a high-quality classification model for the new data. We show that this method can allow us to learn an accurate model using only a tiny amount of new data and a large amount of old data, even when the new data are not sufficient to train a model alone. We show that TrAdaBoost allows knowledge to be effectively transferred from the old data to the new. The effectiveness of our algorithm is analyzed theoretically and empirically to show that our iterative algorithm can converge well to an accurate model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Ben-David, S., & Schuller, R. (2003). Exploiting task relatedness for multiple task learning. Proceedings of the Sixteenth Annual Conference on Learning Theory.
 
2
Bickel, S., & Scheffer, T. (2007). Dirichlet-enhanced spam filtering based on biased samples. In Advances in neural information processing systems 19.
3
 
4
 
5
DauméIII, H., & Marcu, D. (2006). Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research, 26, 101--126.
 
6
Dudík, M., Schapire, R., & Phillips, S. (2006). Correcting sample selection bias in maximum entropy density estimation. In Advances in neural information processing systems 18.
 
7
 
8
Heckman, J. J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153--161.
 
9
Huang, J., Smola, A., Gretton, A., Borgwardt, K. M., & Schöölkopf, B. (2007). Correcting sample selection bias by unlabeled data. In Advances in neural information processing systems 19.
 
10
 
11
Joachims, T. (2002). Learning to classify text using support vector machines. Dissertation, Kluwer.
 
12
Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. Annals of Mathematical Statistics, 22(1), 79--86.
13
 
14
Rosenstein, M. T., Marx, Z., Kaelbling, L. P., & Dietterich, T. G. (2005). To transfer or not to transfer. Proceedings of NIPS 2005 Workshop on Inductive Transfer: 10 Years Later.
 
15
 
16
 
17
Schmidhuber, J. (1994). On learning how to learn learning strategies (Technical Report FKI-198-94). Fakultat fur Informatik.
 
18
Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90, 227--244.
 
19
Thrun, S., & Mitchell, T. M. (1995). Learning one more thing. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence.
20
21

CITED BY  9
Collaborative Colleagues:
Wenyuan Dai: colleagues
Qiang Yang: colleagues
Gui-Rong Xue: colleagues
Yong Yu: colleagues