|
ABSTRACT
Logistic Regression is a well-known classification method that has been used widely in many applications of data mining, machine learning, computer vision, and bioinformatics. Sparse logistic regression embeds feature selection in the classification framework using the l1-norm regularization, and is attractive in many applications involving high-dimensional data. In this paper, we propose Lassplore for solving large-scale sparse logistic regression. Specifically, we formulate the problem as the l1-ball constrained smooth convex optimization, and propose to solve the problem using the Nesterov's method, an optimal first-order black-box method for smooth convex optimization. One of the critical issues in the use of the Nesterov's method is the estimation of the step size at each of the optimization iterations. Previous approaches either applies the constant step size which assumes that the Lipschitz gradient is known in advance, or requires a sequence of decreasing step size which leads to slow convergence in practice. In this paper, we propose an adaptive line search scheme which allows to tune the step size adaptively and meanwhile guarantees the optimal convergence rate. Empirical comparisons with several state-of-the-art algorithms demonstrate the efficiency of the proposed Lassplore algorithm for large-scale problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
U. Alon, N. Barkai, D. A. Notterman, K. Gish, S.Ybarra, D.Mack, andA. J. Levine. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Cell Biology, 96:6745--6750, 1999.
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
M. Chang, W. Yih, and C. Meek. A logistic regression model for detecting prominences. In ACM SIGKDD International conference on Knowledge Discovery and Data mining, 1996.
|
 |
6
|
John Duchi , Shai Shalev-Shwartz , Yoram Singer , Tushar Chandra, Efficient projections onto the l1-ball for learning in high dimensions, Proceedings of the 25th international conference on Machine learning, p.272-279, July 05-09, 2008, Helsinki, Finland
[doi> 10.1145/1390156.1390191]
|
| |
7
|
B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. Least angle regression. Annals of Statistics, 32(2):407--499, 2004.
|
| |
8
|
S. Eyheramendy, E. Genkin, W. Ju, D.D. Lewis, and D. Madigan. Sparse bayesian classifiers for text categorization. Technical report.
|
| |
9
|
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: a statistical view of boosting. Annals of Statistics, 28:337--407, 2000.
|
| |
10
|
J. Friedman, T. Hastie, and R. Tibshirani. Regularized paths for generalized linear models via coordinate descent. Technical report, Department of Statistics, Stanford University, 2008.
|
| |
11
|
W. Fu. Penalized regressions: the bridge versus the lasso. Journal of Computational and Graphical Statistics, 7:397--416, 1998.
|
| |
12
|
T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P.Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286:531--537, 1999.
|
| |
13
|
E.T. Hale, W. Yin, and Y. Zhang. A fixed-point continuation method for l<sub>1</sub>-regularized minimization with applications to compressed sensing. Technical report, CAAM TR07-07, 2007.
|
| |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
Sun-In Lee , Honglak Lee , Pieter Abbeel , Andrew Y. Ng, EfficientL1regularized logistic regression, Proceedings of the 21st national conference on Artificial intelligence, p.401-408, July 16-20, 2006, Boston, Massachusetts
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
A. Maghbouleh. A logistic regression model for detecting prominences. In The Fourth International Conference on Spoken Language, 1996.
|
| |
23
|
T. P. Minka. A comparison of numerical optimizers for logistic regression. Technical report, 2007.
|
| |
24
|
A. Nemirovski. Efficient methods in convex programming. Lecture Notes, 1994.
|
| |
25
|
Y. Nesterov. Introductory Lectures on Convex Optimization: A Basic Course. Kluwer Academic Publishers, 2003.
|
 |
26
|
Andrew Y. Ng, Feature selection, L1 vs. L2 regularization, and rotational invariance, Proceedings of the twenty-first international conference on Machine learning, p.78, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015435]
|
| |
27
|
M.Y. Park and T. Hastie. l<sub>1</sub> regularized path algorithm for generalized linear models. Journal of the Royal Statistical Society: Series B, 69:659--677, 2007.
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
S. K. Shevade and S. S. Keerthi. A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 19(17):2246--2253, 2003.
|
| |
32
|
J. Shi, W. Yin, S. Osher, and P. Sajda. A fast algorithm for large scale l<sub>1</sub>-regularized logistic regression. Technical report, CAAM TR08-07, 2008.
|
| |
33
|
M. West, C. Blanchette, H. Dressman, E. Huang, S. Ishida, R. Spang,H. Zuzan, J. A. Olso, J. R. Marks, and J. R. Nevins. Predicting the clinical status of human breast cancer by using gene expression profiles. National Academy of Sciences, 98(20):11462--11467, 2001.
|
| |
34
|
J. Zhu and T. Hastie. Kernel logistic regression and the import vector machine. Journal of Computational and Graphical Statistics, 14(1):1081--1088, 2001.
|
| |
35
|
J. Zhu and T. Hastie. Classification of gene microarrays by penalized logistic regression. Biostatistics, 5(3):427--443, 2004.
|
|