ACM Home Page
Please provide us with feedback. Feedback
Bolasso: model consistent Lasso estimation through the bootstrap
Full text PdfPdf (315 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 33-40  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Author
Francis R. Bach  Laboratoire d'Informatique de l'Ecole Normale Supérieure, Paris, France
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 44,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390161
What is a DOI?

ABSTRACT

We consider the least-square linear regression problem with regularization by the l1-norm, a problem usually referred to as the Lasso. In this paper, we present a detailed asymptotic analysis of model consistency of the Lasso. For various decays of the regularization parameter, we compute asymptotic equivalents of the probability of correct model selection (i.e., variable selection). For a specific rate decay, we show that the Lasso selects all the variables that should enter the model with probability tending to one exponentially fast, while it selects all other variables with strictly positive probability. We show that this property implies that if we run the Lasso for several bootstrapped replications of a given sample, then intersecting the supports of the Lasso bootstrap estimates leads to consistent model selection. This novel variable selection algorithm, referred to as the Bolasso, is compared favorably to other linear regression methods on synthetic data and datasets from the UCI machine learning repository.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Asuncion, A., & Newman, D. (2007). UCI machine learning repository.
 
2
 
3
Bentkus, V. (2003). On the dependence of the Berry--Esseen bound on dimension. Journal of Statistical Planning and Inference, 113, 385--402.
 
4
Boucheron, S., Lugosi, G., & Bousquet, O. (2004). Concentration inequalities. Advanced Lectures on Machine Learning. Springer.
 
5
 
6
Breiman, L. (1996b). Heuristics of instability and stabilization in model selection. Ann. Stat., 24, 2350--2383.
 
7
Breiman, L. (1998). Arcing classifier. Ann. Stat., 26, 801--849.
 
8
Bühlmann, P. (2006). Boosting for high-dimensional linear models. Ann. Stat., 34, 559--583.
 
9
Efron, B., Hastie, T., Johnstone, I., & Tibshirani, R. (2004). Least angle regression. Ann. Stat., 32, 407.
 
10
Efron, B., & Tibshirani, R. J. (1998). An introduction to the bootstrap. Chapman & Hall.
 
11
Fu, W., & Knight, K. (2000). Asymptotics for Lasso-type estimators. Ann. Stat., 28, 1356--1378.
 
12
Lounici, K. (2008). Sup-norm convergence rate and sign concentration property of Lasso and Dantzig estimators. Electronic Journal of Statistics, 2.
 
13
Meinshausen, N., & Yu, B. (2008). Lasso-type recovery of sparse representations for high-dimensional data. Ann. Stat., to appear.
 
14
Tibshirani, R. (1994). Regression shrinkage and selection via the Lasso. J. Roy. Stat. Soc. B, 58, 267--288.
 
15
Wainwright, M. J. (2006). Sharp thresholds for noisy and high-dimensional recovery of sparsity using l1-constrained quadratic programming (Tech. report 709). Dpt. of Statistics, UC Berkeley.
 
16
Yuan, M., & Lin, Y. (2007). On the non-negative garrotte estimator. J. Roy. Stat. Soc. B, 69, 143--161.
 
17
 
18
Zou, H. (2006). The adaptive Lasso and its oracle properties. J. Am. Stat. Ass., 101, 1418--1429.