|
ABSTRACT
In Kernel Fisher discriminant analysis (KFDA), we carry out Fisher linear discriminant analysis in a high dimensional feature space defined implicitly by a kernel. The performance of KFDA depends on the choice of the kernel; in this paper, we consider the problem of finding the optimal kernel, over a given convex set of kernels. We show that this optimal kernel selection problem can be reformulated as a tractable convex optimization problem which interior-point methods can solve globally and efficiently. The kernel selection method is demonstrated with some UCI machine learning benchmark examples.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Francis R. Bach , Gert R. G. Lanckriet , Michael I. Jordan, Multiple kernel learning, conic duality, and the SMO algorithm, Proceedings of the twenty-first international conference on Machine learning, p.6, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015424]
|
 |
2
|
|
 |
3
|
|
| |
4
|
Bousquet, O., & Herrmann, D. (2003). On the complexity of learning the kernel matrix. In Advances in Neural Information Processing Systems, 15, MIT Press.
|
| |
5
|
|
| |
6
|
Crammer, K., Keshet, J., & Singer, Y. (2003). Kernel design using boosting. In Advances in Neural Information Processing Systems, 15, MIT Press.
|
| |
7
|
Cristianini, N., Elisseeff, A., Shawe-Taylor, J., & Kandla, J. (2001). On kernel target alignment. In Advances in Neural Information Processing Systems, 13, pp. 367--373, MIT Press.
|
 |
8
|
Glenn Fung , Murat Dundar , Jinbo Bi , Bharat Rao, A fast iterative algorithm for fisher discriminant using heterogeneous kernels, Proceedings of the twenty-first international conference on Machine learning, p.40, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015409]
|
| |
9
|
Hastie, T., Tibshirani, R., & Friedman, J. (2001). The elements of statistical learning: Data mining, inference, and prediction. Springer-Verlag.
|
| |
10
|
|
| |
11
|
|
| |
12
|
Charles A. Micchelli , Massimiliano Pontil, Learning the Kernel Function via Regularization, The Journal of Machine Learning Research, 6, p.1099-1125, 9/1/2005
|
| |
13
|
Mika, S., Rätsch, G., & Müüller, K. (2001). A mathematical programming approach to the kernel Fisher algorithm. In Advances in Neural Information Processing Systems, 13, pp. 591--597, MIT Press.
|
| |
14
|
Sebastian Mika , Gunnar Rätsch , Jason Weston , Bernhard Schölkopf , Alex Smola , Klaus-Robert Müller, Constructing Descriptive and Discriminative Nonlinear Features: Rayleigh Coefficients in Kernel Feature Spaces, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.25 n.5, p.623-633, May 2003
[doi> 10.1109/TPAMI.2003.1195996]
|
| |
15
|
Nesterov, Y., & Nemirovsky, A. (1994). Interior-point polynomial methods in convex programming, vol. 13 of Studies in Applied Mathematics. Philadelphia, PA: SIAM.
|
| |
16
|
Newman, D., Hettich, S., Blake, C., & Merz, C. (1998). UCI repository of machine learning databases. Available from www.ics.uci.edu/~mlearn/MLRepository.html.
|
| |
17
|
Cheng Soon Ong , Alexander J. Smola , Robert C. Williamson, Learning the Kernel with Hyperkernels, The Journal of Machine Learning Research, 6, p.1043-1071, 9/1/2005
|
| |
18
|
Pepe, M. (2000). Receiver operating characteristic methodology. Journal of the American Statistical Association, 95, 308--311.
|
| |
19
|
|
| |
20
|
Sturm, J. (2001). Using SeDuMi 1.02, a Matlab toolbox for optimization over symmetric cones. Available from sedumi.mcmaster.ca/.
|
| |
21
|
Toh, K., Tütüncü, R., & Todd, M. (2002). SDPT3 version 3.02. a Matlab software for semidefinite-quadratic-linear programming. Available from www.math.nus.edu.sg/~mattohkc/sdpt3.html.
|
| |
22
|
|
| |
23
|
Xiong, H., Swamy, M., & Ahmad, M. (2005). Optimizing the kernel in the empirical feature space. IEEE Transactions on Neural Networks, 16, 460--474.
|
| |
24
|
|
CITED BY 7
|
|
|
|
|
|
|
|
|
|
|
Jianhui Chen , Zheng Zhao , Jieping Ye , Huan Liu, Nonlinear adaptive distance metric learning for clustering, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
Glenn Fung , Sriram Krishnan , R. Bharat Rao , Hui Chen, Learning sparse kernels from 3D surfaces for heart wall motion abnormality detection, Proceedings of the 20th national conference on Innovative applications of artificial intelligence, p.1663-1670, July 13-17, 2008, Chicago, Illinois
|
|