| Generalized spectral bounds for sparse LDA |
| Full text |
Pdf
(202 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 148
archive
Proceedings of the 23rd international conference on Machine learning
table of contents
Pittsburgh, Pennsylvania
Pages: 641 - 648
Year of Publication: 2006
ISBN:1-59593-383-2
|
|
Authors
|
|
Baback Moghaddam
|
Mitsubishi Electric Research Laboratories (MERL), Cambridge MA
|
|
Yair Weiss
|
The Hebrew University of Jerusalem, Jerusalem, Israel
|
|
Shai Avidan
|
Mitsubishi Electric Research Laboratories (MERL), Cambridge MA
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 19, Downloads (12 Months): 63, Citation Count: 4
|
|
|
ABSTRACT
We present a discrete spectral framework for the sparse or cardinality-constrained solution of a generalized Rayleigh quotient. This NP-hard combinatorial optimization problem is central to supervised learning tasks such as sparse LDA, feature selection and relevance ranking for classification. We derive a new generalized form of the Inclusion Principle for variational eigenvalue bounds, leading to exact and optimal sparse linear discriminants using branch-and-bound search. An efficient greedy (approximate) technique is also presented. The generalization performance of our sparse LDA algorithms is demonstrated with real-world UCI ML benchmarks and compared to a leading SVM-based gene selection algorithm for cancer classification.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alon et al. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon cancer tissues Cell Biology, 96, 6745--6750.
|
| |
2
|
|
| |
3
|
d'Aspremont, A., Ghaoui, L. E., Jordan, M. I., & Lanckriet, G. R. G. (2004). A direct formulation for sparse PCA using semidefinite programming. Advances in Neural Information Processing Systems 17 (pp. 803--809). MIT Press.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Ko, C., Lee, J., & Queyranne, M. (1995). An exact algorithm for maximum entropy sampling. Operations Research, 43, 684--691.
|
| |
8
|
Kohavi, R., & John, G. (2003). Wrappers for feature subset selection. Artificial Intelligence Journal, 3, 1157--1182.
|
| |
9
|
Mika, S., Rätsch, G., & Müüller, K.-R. (2001). A mathematical programming approach to the kernel fisher algorithm. Advances in Neural Information Processing Systems 13 (pp. 591--597). MIT Press.
|
| |
10
|
Moghaddam, B., Weiss, Y., & Avidan, S. (2006). Spectral Bounds for Sparse PCA: Exact & Greedy Algorithms. Advances in Neural Information Processing Systems 18 (pp. 915--922). MIT Press.
|
| |
11
|
|
| |
12
|
|
| |
13
|
Tibshirani, R. (1995). Regression shrinkage and selection via Lasso. Journal of the Royal Statistical Society B, 58, 267--288.
|
| |
14
|
Zou, H., Hastie, T., & Tibshirani, R. (2004). Sparse principal component analysis (Technical Report). Statistics Department, Stanford University.
|
|