|
ABSTRACT
While classical kernel-based classifiers are based on a single kernel, in practice it is often desirable to base classifiers on combinations of multiple kernels. Lanckriet et al. (2004) considered conic combinations of kernel matrices for the support vector machine (SVM), and showed that the optimization of the coefficients of such a combination reduces to a convex optimization problem known as a quadratically-constrained quadratic program (QCQP). Unfortunately, current convex optimization toolboxes can solve this problem only for a small number of kernels and a small number of data points; moreover, the sequential minimal optimization (SMO) techniques that are essential in large-scale implementations of the SVM cannot be applied because the cost function is non-differentiable. We propose a novel dual formulation of the QCQP as a second-order cone programming problem, and show how to exploit the technique of Moreau-Yosida regularization to yield a formulation to which SMO techniques can be applied. We present experimental results that show that our SMO-based algorithm is significantly more efficient than the general-purpose interior point methods available in current optimization toolboxes.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Andersen, E. D., & Andersen, K. D. (2000). The MOSEK interior point optimizer for linear programming: an implementation of the homogeneous algorithm. High Perf. Optimization (pp. 197--232).
|
| |
2
|
Bertsekas, D. (1995). Nonlinear programming. Nashua, NH: Athena Scientific.
|
| |
3
|
|
| |
4
|
Brent, R. P. (1973). Algorithms for minimization without derivatives. Englewood Cliffs, NJ: Prentice-Hall.
|
| |
5
|
|
| |
6
|
Grandvalet, Y., & Canu, S. (2003). Adaptive scaling for feature selection in SVMs. Neural Information Processing Systems. Cambridge, MA: MIT Press.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
Lobo, M. S., Vandenberghe, L., Boyd, S., & Léébret, H. (1998). Applications of second-order cone programming. Lin. Alg. and its Applications, 284, 193--228.
|
| |
12
|
Ong, S., Smola, A. J., & Williamson, R. C. (2003). Hyperkernels. Neural Information Processing Systems. Cambridge, MA: MIT Press.
|
| |
13
|
|
CITED BY 37
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andreas Argyriou , Raphael Hauser , Charles A. Micchelli , Massimiliano Pontil, A DC-programming algorithm for kernel selection, Proceedings of the 23rd international conference on Machine learning, p.41-48, June 25-29, 2006, Pittsburgh, Pennsylvania
|
|
|
|
|
|
|
|
|
|
|
|
Jianhui Chen , Zheng Zhao , Jieping Ye , Huan Liu, Nonlinear adaptive distance metric learning for clustering, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
Alain Rakotomamonjy , Francis Bach , Stéphane Canu , Yves Grandvalet, More efficiency in multiple kernel learning, Proceedings of the 24th international conference on Machine learning, p.775-782, June 20-24, 2007, Corvalis, Oregon
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Wenyuan Dai , Qiang Yang , Gui-Rong Xue , Yong Yu, Self-taught clustering, Proceedings of the 25th international conference on Machine learning, p.200-207, July 05-09, 2008, Helsinki, Finland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Zenglin Xu , Rong Jin , Jieping Ye , Michael R. Lyu , Irwin King, Non-monotonic feature selection, Proceedings of the 26th Annual International Conference on Machine Learning, p.1145-1152, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
Marius Kloft , Ulf Brefeld , Patrick Düessel , Christian Gehl , Pavel Laskov, Automatic feature selection for anomaly detection, Proceedings of the 1st ACM workshop on Workshop on AISec, October 27-27, 2008, Alexandria, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Matthieu Kowalski , Marie Szafranski , Liva Ralaivola, Multiple indefinite kernel learning with mixed norm regularization, Proceedings of the 26th Annual International Conference on Machine Learning, p.545-552, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
Yu-Feng Li , James T. Kwok , Zhi-Hua Zhou, Semi-supervised learning using label mean, Proceedings of the 26th Annual International Conference on Machine Learning, p.633-640, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|