ACM Home Page
Please provide us with feedback. Feedback
SVM optimization: inverse dependence on training set size
Full text PdfPdf (349 KB)
Source ICML; Vol. 307 archive
Proceedings of the 25th international conference on Machine learning table of contents
Helsinki, Finland
Pages 928-935  
Year of Publication: 2008
ISBN:978-1-60558-205-4
Authors
Shai Shalev-Shwartz  Toyota Technological Institute at Chicago, Chicago IL
Nathan Srebro  Toyota Technological Institute at Chicago, Chicago IL
Sponsors
: Yahoo!
: Xerox
IBM : IBM
: NSF
Microsoft Research : Microsoft Research
: Machine Learning Journal/Springer
: Pascal
: University of Helsinki
: Federation of Finnish Learned Societies
: Intel Corporation
: Google
: Helsinki Institute for Information Technology
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 70,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390156.1390273
What is a DOI?

ABSTRACT

We discuss how the runtime of SVM optimization should decrease as the size of the training data increases. We present theoretical and empirical results demonstrating how a simple subgradient descent approach indeed displays such behavior, at least for linear kernels.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Bottou, L. (Web Page). Stochastic gradient descent examples. http://leon.bottou.org/projects/sgd.
 
3
Bottou, L., & Bousquet, O. (2008). The tradeoffs of large scale learning. Advances in Neural Information Processing Systems 20.
 
4
Bottou, L., & LeCun, Y. (2004). Large scale online learning. Advances in Neural Information Processing Systems 16.
 
5
Bottou, L., & Lin, C.-J. (2007). Support vector machine solvers. In L. Bottou, O. Chapelle, D. DeCoste and J. Weston (Eds.), Large scale kernel machines. MIT Press.
 
6
7
 
8
Lin, C.-J. (2002). A formal analysis of stopping criteria of decomposition methods for support vector machines. IEEE Transactions on Neural Networks, 13, 1045--1052.
 
9
10
 
11
Smola, A., Vishwanathan, S., & Le, Q. (2008). Bundle methods for machine learning. Advances in Neural Information Processing Systems 20.
 
12
Sridharan, K. (2008). Fast convergence rates for excess regularized risk with application to SVM. http://ttic.uchicago.edu/~karthik/con.pdf.
13


Collaborative Colleagues:
Shai Shalev-Shwartz: colleagues
Nathan Srebro: colleagues