ACM Home Page
Please provide us with feedback. Feedback
Regularized multi--task learning
Full text PdfPdf (218 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Seattle, WA, USA
SESSION: Research track papers table of contents
Pages: 109 - 117  
Year of Publication: 2004
ISBN:1-58113-888-1
Authors
Theodoros Evgeniou  INSEAD, Fontainebleau, France
Massimiliano Pontil  University College London, London, UK
Sponsors
SIGMOD: ACM Special Interest Group on Management of Data
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 137,   Citation Count: 15
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1014052.1014067
What is a DOI?

ABSTRACT

Past empirical work has shown that learning multiple related tasks from data simultaneously can be advantageous in terms of predictive performance relative to learning these tasks independently. In this paper we present an approach to multi--task learning based on the minimization of regularization functionals similar to existing ones, such as the one for Support Vector Machines (SVMs), that have been successfully used in the past for single--task learning. Our approach allows to model the relation between tasks in terms of a novel kernel function that uses a task--coupling parameter. We implement an instance of the proposed approach similar to SVMs and test it empirically using simulated as well as real data. The experimental results show that the proposed method performs better than existing multi--task learning methods and largely outperforms single--task learning using SVMs.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G.M. Allenby and P.E. Rossi. Marketing Models of Consumer Heterogeneity. Journal of Econometrics, 89, p. 57--78, 1999.
 
2
 
3
N. Arora and J. Huber. Improving parameter estimates and model prediction by aggregate customization in choice experiments. Journal of Consumer Research, Vol. 28, September 2001.
 
4
 
5
 
6
J. Baxter. A Model for Inductive Bias Learning. Journal of Artificial Intelligence Research, 12, p. 149--198, 2000.
7
 
8
S. Ben-David and R. Schuller. Exploiting Task Relatedness for Multiple Task Learning. Proceedings of Computational Learning Theory (COLT), 2003.
 
9
L. Breiman and J.H Friedman. Predicting Multivariate Responses in Multiple Linear Regression. Royal Statistical Society Series B, 1998.
 
10
P.J. Brown and J.V. Zidek. Adaptive Multivariate Ridge Regression. The Annals of Statistics, Vol. 8, No. 1, p. 64--74, 1980.
 
11
 
12
T. Evgeniou, C. Boussios, and G. Zacharia. Generalized Robust Conjoint Estimation. INSEAD Working Paper, 2002.
 
13
T. Evgeniou, M. Pontil, and T. Poggio. Regularization networks and support vector machines. Advances in Computational Mathematics, 13:1--50, 2000.
 
14
B. Heisele, T. Serre, M. Pontil, T. Vetter, and T. Poggio. Categorization by Learning and Combining Object Parts. In: Advances in Neural Information Processing Systems 14, Vancouver, Canada, Vol. 2, 1239--1245, 2002.
 
15
 
16
 
17
 
18
G.R.G. Lanckriet, T. De Bie, N. Cristianini, M.I. Jordan, and W.S. Noble. A framework for genomic data fusion and its application to membrane protein prediction. Technical Report CSD--03--1273, Division of Computer Science, University of California, Berkeley, 2003.
 
19
O.L. Mangasarian. Nonlinear Programming. Classics in Applied Mathematics. SIAM, 1994.
 
20
C.A. Micchelli and M. Pontil. On Learning Vector--Valued Functions. Research Note RN/03/08, Dept of Computer Science, UCL, July 2003.
 
21
D.L. Silver and R.E Mercer. The parallel transfer of task knowledge using dynamic learning rates based on a measure of relatedness. Connection Science, 8, p. 277--294, 1996.
 
22
 
23
 
24
O. Toubia, D.I. Simester, J.R. Hauser, and E. Dahan. Fast Polyhedral Adaptive Conjoint Estimation. Working paper, MIT Sloan School of Management, 2001.
 
25
 
26
G. Wahba. Splines Models for Observational Data. Series in Applied Mathematics, Vol. 59, SIAM, Philadelphia, 1990.

CITED BY  15

Collaborative Colleagues:
Theodoros Evgeniou: colleagues
Massimiliano Pontil: colleagues