|
ABSTRACT
Recent developments in programmable, highly parallel Graphics Processing Units (GPUs) have enabled high performance implementations of machine learning algorithms. We describe a solver for Support Vector Machine training running on a GPU, using the Sequential Minimal Optimization algorithm and an adaptive first and second order working set selection heuristic, which achieves speedups of 9-35x over LIBSVM running on a traditional processor. We also present a GPU-based system for SVM classification which achieves speedups of 81-138x over LIBSVM (5-24x over our own CPU based SVM classifier).
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Asanović, K., Bodik, R., Catanzaro, B. C., Gebis, J. J., Husbands, P., Keutzer, K., Patterson, D. A., Plishker, W. L., Shalf, J., Williams, S. W., & Yelick, K. A. (2006). The Landscape of Parallel Computing Research: A View from Berkeley (Technical Report UCB/EECS-2006-183). EECS Department, University of California, Berkeley.
|
| |
2
|
Asuncion, A., & Newman, D. (2007). UCI machine learning repository.
|
| |
3
|
|
| |
4
|
Cao, L., Keerthi, S., Ong, C.-J., Zhang, J., Periyathamby, U., Fu, X. J., & Lee, H. (2006). Parallel sequential minimal optimization for the training of support vector machines. IEEE Transactions on Neural Networks, 17, 1039--1049.
|
| |
5
|
Chu, C.-T., Kim, S. K., Lin, Y.-A., Yu, Y., Bradski, G., Ng, A. Y., & Olukotun, K. (2007). Map-reduce for machine learning on multicore. In B. Schöölkopf, J. Platt and T. Hoffman (Eds.), Advances in neural information processing systems 19, 281--288. Cambridge, MA: MIT Press.
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
Ferreira, L. V., Kaskurewicz, E., & Bhaya, A. (2006). Parallel implementation of gradient-based neural networks for svm training. International Joint Conference on Neural Networks.
|
| |
11
|
Graf, H. P., Cosatto, E., Bottou, L., Dourdanovic, I., & Vapnik, V. (2005). Parallel support vector machines: The cascade svm. In L. K. Saul, Y. Weiss and L. Bottou (Eds.), Advances in neural information processing systems 17, 521--528. Cambridge, MA: MIT Press.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278--2324.
|
| |
16
|
Nvidia (2007). Nvidia CUDA. http://nvidia.com/cuda.
|
| |
17
|
Osuna, E., Freund, R., & Girosi, F. (1997). An improved training algorithm for support vector machines. Neural Networks for Signal Processing {1997} VII. Proceedings of the 1997 IEEE Workshop, 276--285.
|
| |
18
|
|
| |
19
|
|
 |
20
|
Gang Wu , Edward Chang , Yen Kuang Chen , Christoper Hughes, Incremental approximate matrix factorization for speeding up support vector machines, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
[doi> 10.1145/1150402.1150500]
|
| |
21
|
|
CITED BY 3
|
|
Rajat Raina , Anand Madhavan , Andrew Y. Ng, Large-scale deep unsupervised learning using graphics processors, Proceedings of the 26th Annual International Conference on Machine Learning, p.873-880, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|