| Efficient Euclidean projections in linear time |
| Full text |
Pdf
(639 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 657-664
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
Jun Liu
|
Arizona State University, Tempe, AZ
|
|
Jieping Ye
|
Arizona State University, Tempe, AZ
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 28, Citation Count: 1
|
|
|
ABSTRACT
We consider the problem of computing the Euclidean projection of a vector of length n onto a closed convex set including the l1 ball and the specialized polyhedra employed in (Shalev-Shwartz & Singer, 2006). These problems have played building block roles in solving several l1-norm based sparse learning problems. Existing methods have a worst-case time complexity of O(n log n). In this paper, we propose to cast both Euclidean projections as root finding problems associated with specific auxiliary functions, which can be solved in linear time via bisection. We further make use of the special structure of the auxiliary functions, and propose an improved bisection algorithm. Empirical studies demonstrate that the proposed algorithms are much more efficient than the competing ones for computing the projections.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Brent, R. (1971). An algorithm with guaranteed convergence for finding a zero of a function. Computer Journal, 14, 422--425.
|
| |
3
|
Candès, E., & Wakin, M. (2008). An introduction to compressive sampling. IEEE Signal Processing Magazine, 25, 21--30.
|
| |
4
|
|
| |
5
|
Donoho, D. (2006). Compressed sensing. IEEE Transactions on Information Theory, 52, 1289--1306.
|
 |
6
|
John Duchi , Shai Shalev-Shwartz , Yoram Singer , Tushar Chandra, Efficient projections onto the l1-ball for learning in high dimensions, Proceedings of the 25th international conference on Machine learning, p.272-279, July 05-09, 2008, Helsinki, Finland
[doi> 10.1145/1390156.1390191]
|
| |
7
|
|
| |
8
|
Nemirovski, A. (1994). Efficient methods in convex programming. Lecture Notes.
|
| |
9
|
Nesterov, Y. (2003). Introductory lectures on convex optimization: A basic course. Kluwer Academic Publishers.
|
 |
10
|
Andrew Y. Ng, Feature selection, L1 vs. L2 regularization, and rotational invariance, Proceedings of the twenty-first international conference on Machine learning, p.78, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015435]
|
| |
11
|
Shalev-Shwartz, S. (2007). Online learning: Theory algorithms, and applications. Doctoral dissertation, Hebrew University.
|
| |
12
|
|
| |
13
|
Shalev-Shwartz, S., & Srebro, N. (2008). Iterative loss minimization with l1-norm constraint and guarantees on sparsity (Technical Report). TTI.
|
| |
14
|
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B, 58, 267--288.
|
|