ACM Home Page
Please provide us with feedback. Feedback
Large-scale sparse logistic regression
Full text MovMov (25:58),  PdfPdf (543 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 547-556  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Jun Liu  Arizona State University, Tempe, AZ, USA
Jianhui Chen  Arizona State University, Tempe, AZ, USA
Jieping Ye  Arizona State University, Tempe, AZ, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 58,   Downloads (12 Months): 218,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557082
What is a DOI?

ABSTRACT

Logistic Regression is a well-known classification method that has been used widely in many applications of data mining, machine learning, computer vision, and bioinformatics. Sparse logistic regression embeds feature selection in the classification framework using the l1-norm regularization, and is attractive in many applications involving high-dimensional data. In this paper, we propose Lassplore for solving large-scale sparse logistic regression. Specifically, we formulate the problem as the l1-ball constrained smooth convex optimization, and propose to solve the problem using the Nesterov's method, an optimal first-order black-box method for smooth convex optimization. One of the critical issues in the use of the Nesterov's method is the estimation of the step size at each of the optimization iterations. Previous approaches either applies the constant step size which assumes that the Lipschitz gradient is known in advance, or requires a sequence of decreasing step size which leads to slow convergence in practice. In this paper, we propose an adaptive line search scheme which allows to tune the step size adaptively and meanwhile guarantees the optimal convergence rate. Empirical comparisons with several state-of-the-art algorithms demonstrate the efficiency of the proposed Lassplore algorithm for large-scale problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
U. Alon, N. Barkai, D. A. Notterman, K. Gish, S.Ybarra, D.Mack, andA. J. Levine. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Cell Biology, 96:6745--6750, 1999.
 
2
 
3
 
4
 
5
M. Chang, W. Yih, and C. Meek. A logistic regression model for detecting prominences. In ACM SIGKDD International conference on Knowledge Discovery and Data mining, 1996.
6
 
7
B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. Least angle regression. Annals of Statistics, 32(2):407--499, 2004.
 
8
S. Eyheramendy, E. Genkin, W. Ju, D.D. Lewis, and D. Madigan. Sparse bayesian classifiers for text categorization. Technical report.
 
9
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: a statistical view of boosting. Annals of Statistics, 28:337--407, 2000.
 
10
J. Friedman, T. Hastie, and R. Tibshirani. Regularized paths for generalized linear models via coordinate descent. Technical report, Department of Statistics, Stanford University, 2008.
 
11
W. Fu. Penalized regressions: the bridge versus the lasso. Journal of Computational and Graphical Statistics, 7:397--416, 1998.
 
12
T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P.Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, and E. S. Lander. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286:531--537, 1999.
 
13
E.T. Hale, W. Yin, and Y. Zhang. A fixed-point continuation method for l<sub>1</sub>-regularized minimization with applications to compressed sensing. Technical report, CAAM TR07-07, 2007.
 
14
 
15
 
16
 
17
 
18
 
19
 
20
21
 
22
A. Maghbouleh. A logistic regression model for detecting prominences. In The Fourth International Conference on Spoken Language, 1996.
 
23
T. P. Minka. A comparison of numerical optimizers for logistic regression. Technical report, 2007.
 
24
A. Nemirovski. Efficient methods in convex programming. Lecture Notes, 1994.
 
25
Y. Nesterov. Introductory Lectures on Convex Optimization: A Basic Course. Kluwer Academic Publishers, 2003.
26
 
27
M.Y. Park and T. Hastie. l<sub>1</sub> regularized path algorithm for generalized linear models. Journal of the Royal Statistical Society: Series B, 69:659--677, 2007.
 
28
 
29
 
30
 
31
S. K. Shevade and S. S. Keerthi. A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 19(17):2246--2253, 2003.
 
32
J. Shi, W. Yin, S. Osher, and P. Sajda. A fast algorithm for large scale l<sub>1</sub>-regularized logistic regression. Technical report, CAAM TR08-07, 2008.
 
33
M. West, C. Blanchette, H. Dressman, E. Huang, S. Ishida, R. Spang,H. Zuzan, J. A. Olso, J. R. Marks, and J. R. Nevins. Predicting the clinical status of human breast cancer by using gene expression profiles. National Academy of Sciences, 98(20):11462--11467, 2001.
 
34
J. Zhu and T. Hastie. Kernel logistic regression and the import vector machine. Journal of Computational and Graphical Statistics, 14(1):1081--1088, 2001.
 
35
J. Zhu and T. Hastie. Classification of gene microarrays by penalized logistic regression. Biostatistics, 5(3):427--443, 2004.

Collaborative Colleagues:
Jun Liu: colleagues
Jianhui Chen: colleagues
Jieping Ye: colleagues