ACM Home Page
Please provide us with feedback. Feedback
Generalized spectral bounds for sparse LDA
Full text PdfPdf (202 KB)
Source ACM International Conference Proceeding Series; Vol. 148 archive
Proceedings of the 23rd international conference on Machine learning table of contents
Pittsburgh, Pennsylvania
Pages: 641 - 648  
Year of Publication: 2006
ISBN:1-59593-383-2
Authors
Baback Moghaddam  Mitsubishi Electric Research Laboratories (MERL), Cambridge MA
Yair Weiss  The Hebrew University of Jerusalem, Jerusalem, Israel
Shai Avidan  Mitsubishi Electric Research Laboratories (MERL), Cambridge MA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 71,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1143844.1143925
What is a DOI?

ABSTRACT

We present a discrete spectral framework for the sparse or cardinality-constrained solution of a generalized Rayleigh quotient. This NP-hard combinatorial optimization problem is central to supervised learning tasks such as sparse LDA, feature selection and relevance ranking for classification. We derive a new generalized form of the Inclusion Principle for variational eigenvalue bounds, leading to exact and optimal sparse linear discriminants using branch-and-bound search. An efficient greedy (approximate) technique is also presented. The generalization performance of our sparse LDA algorithms is demonstrated with real-world UCI ML benchmarks and compared to a leading SVM-based gene selection algorithm for cancer classification.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Alon et al. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon cancer tissues Cell Biology, 96, 6745--6750.
 
2
 
3
d'Aspremont, A., Ghaoui, L. E., Jordan, M. I., & Lanckriet, G. R. G. (2004). A direct formulation for sparse PCA using semidefinite programming. Advances in Neural Information Processing Systems 17 (pp. 803--809). MIT Press.
 
4
 
5
 
6
 
7
Ko, C., Lee, J., & Queyranne, M. (1995). An exact algorithm for maximum entropy sampling. Operations Research, 43, 684--691.
 
8
Kohavi, R., & John, G. (2003). Wrappers for feature subset selection. Artificial Intelligence Journal, 3, 1157--1182.
 
9
Mika, S., Rätsch, G., & Müüller, K.-R. (2001). A mathematical programming approach to the kernel fisher algorithm. Advances in Neural Information Processing Systems 13 (pp. 591--597). MIT Press.
 
10
Moghaddam, B., Weiss, Y., & Avidan, S. (2006). Spectral Bounds for Sparse PCA: Exact & Greedy Algorithms. Advances in Neural Information Processing Systems 18 (pp. 915--922). MIT Press.
 
11
 
12
 
13
Tibshirani, R. (1995). Regression shrinkage and selection via Lasso. Journal of the Royal Statistical Society B, 58, 267--288.
 
14
Zou, H., Hastie, T., & Tibshirani, R. (2004). Sparse principal component analysis (Technical Report). Statistics Department, Stanford University.


Collaborative Colleagues:
Baback Moghaddam: colleagues
Yair Weiss: colleagues
Shai Avidan: colleagues