|
ABSTRACT
Past empirical work has shown that learning multiple related tasks from data simultaneously can be advantageous in terms of predictive performance relative to learning these tasks independently. In this paper we present an approach to multi--task learning based on the minimization of regularization functionals similar to existing ones, such as the one for Support Vector Machines (SVMs), that have been successfully used in the past for single--task learning. Our approach allows to model the relation between tasks in terms of a novel kernel function that uses a task--coupling parameter. We implement an instance of the proposed approach similar to SVMs and test it empirically using simulated as well as real data. The experimental results show that the proposed method performs better than existing multi--task learning methods and largely outperforms single--task learning using SVMs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G.M. Allenby and P.E. Rossi. Marketing Models of Consumer Heterogeneity. Journal of Econometrics, 89, p. 57--78, 1999.
|
| |
2
|
|
| |
3
|
N. Arora and J. Huber. Improving parameter estimates and model prediction by aggregate customization in choice experiments. Journal of Consumer Research, Vol. 28, September 2001.
|
| |
4
|
|
| |
5
|
|
| |
6
|
J. Baxter. A Model for Inductive Bias Learning. Journal of Artificial Intelligence Research, 12, p. 149--198, 2000.
|
 |
7
|
|
| |
8
|
S. Ben-David and R. Schuller. Exploiting Task Relatedness for Multiple Task Learning. Proceedings of Computational Learning Theory (COLT), 2003.
|
| |
9
|
L. Breiman and J.H Friedman. Predicting Multivariate Responses in Multiple Linear Regression. Royal Statistical Society Series B, 1998.
|
| |
10
|
P.J. Brown and J.V. Zidek. Adaptive Multivariate Ridge Regression. The Annals of Statistics, Vol. 8, No. 1, p. 64--74, 1980.
|
| |
11
|
|
| |
12
|
T. Evgeniou, C. Boussios, and G. Zacharia. Generalized Robust Conjoint Estimation. INSEAD Working Paper, 2002.
|
| |
13
|
T. Evgeniou, M. Pontil, and T. Poggio. Regularization networks and support vector machines. Advances in Computational Mathematics, 13:1--50, 2000.
|
| |
14
|
B. Heisele, T. Serre, M. Pontil, T. Vetter, and T. Poggio. Categorization by Learning and Combining Object Parts. In: Advances in Neural Information Processing Systems 14, Vancouver, Canada, Vol. 2, 1239--1245, 2002.
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
G.R.G. Lanckriet, T. De Bie, N. Cristianini, M.I. Jordan, and W.S. Noble. A framework for genomic data fusion and its application to membrane protein prediction. Technical Report CSD--03--1273, Division of Computer Science, University of California, Berkeley, 2003.
|
| |
19
|
O.L. Mangasarian. Nonlinear Programming. Classics in Applied Mathematics. SIAM, 1994.
|
| |
20
|
C.A. Micchelli and M. Pontil. On Learning Vector--Valued Functions. Research Note RN/03/08, Dept of Computer Science, UCL, July 2003.
|
| |
21
|
D.L. Silver and R.E Mercer. The parallel transfer of task knowledge using dynamic learning rates based on a measure of relatedness. Connection Science, 8, p. 277--294, 1996.
|
| |
22
|
|
| |
23
|
|
| |
24
|
O. Toubia, D.I. Simester, J.R. Hauser, and E. Dahan. Fast Polyhedral Adaptive Conjoint Estimation. Working paper, MIT Sloan School of Management, 2001.
|
| |
25
|
|
| |
26
|
G. Wahba. Splines Models for Observational Data. Series in Applied Mathematics, Vol. 59, SIAM, Philadelphia, 1990.
|
CITED BY 15
|
|
|
|
|
|
|
|
Shipeng Yu , Kai Yu , Volker Tresp , Hans-Peter Kriegel , Mingrui Wu, Supervised probabilistic principal component analysis, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Steffen Bickel , Jasmina Bogojeska , Thomas Lengauer , Tobias Scheffer, Multi-task learning for HIV therapy screening, Proceedings of the 25th international conference on Machine learning, p.56-63, July 05-09, 2008, Helsinki, Finland
|
|
|
|
|
|
|
|
|
Shuiwang Ji , Lei Yuan , Ying-Xin Li , Zhi-Hua Zhou , Sudhir Kumar , Jieping Ye, Drosophila gene expression pattern annotation using sparse features and term-term interactions, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|
|
|
|
|
|
|
|
Vincent Wenchen Zheng , Sinno Jialin Pan , Qiang Yang , Jeffrey Junfeng Pan, Transferring multi-device localization models using latent multi-task learning, Proceedings of the 23rd national conference on Artificial intelligence, p.1427-1432, July 13-17, 2008, Chicago, Illinois
|
|
|
|
|