|
ABSTRACT
Support Vector Machines (SVMs) suffer from an O(n2) training cost, where n denotes the number of training instances. In this paper, we propose an algorithm to select boundary instances as training data to substantially reduce n. Our proposed algorithm is motivated by the result of (Burges, 1999) that, removing non-support vectors from the training set does not change SVM training results. Our algorithm eliminates instances that are likely to be non-support vectors. In the concept-independent preprocessing step of our algorithm, we prepare nearest-neighbor lists for training instances. In the concept-specific sampling step, we can then effectively select useful training data for each target concept. Empirical studies show our algorithm to be effective in reducing n, outperforming other competing downsampling algorithms without significantly compromising testing accuracy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Achlioptas, D., McSherry, F., & Schöölkopf, B. (2002). Sampling techniques for kernel methods. Advances in Neural Information Proc. Systems.
|
| |
2
|
|
| |
3
|
|
| |
4
|
Chang, C.-C., & Lin, C.-J. (2001). LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
|
| |
5
|
|
| |
6
|
|
| |
7
|
Graf, H. P., Cosatto, E., Bottou, L., Dourdanovic, I., & Vapnik, V. (2005). Parallel support vector machines: The cascade svm. In L. K. Saul, Y. Weiss and L. Bottou (Eds.), Advances in neural information processing systems 17, 521--528. MIT Press.
|
| |
8
|
|
| |
9
|
|
| |
10
|
Lawrence, N. D., Seeger, M., & Herbrich, R. (2003). Fast sparse gaussian process methods: the informative vector machine. S. Becker, S. Thrun and K. Obermayer (eds) Advances in Neural Information Processing Systems. MIT Press.
|
| |
11
|
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278--2324.
|
| |
12
|
Liu, T., Moore, A. W., Gray, A., & Yang, K. (2005). An investigation of practical approximate nearest neighbor algorithms. In L. K. Saul, Y. Weiss and L. Bottou (Eds.), Advances in neural information processing systems 17, 825--832. Cambridge, MA: MIT Press.
|
| |
13
|
Osuna, E., Freund, R., & Girosi, F. (1997). An improved training algorithm for support vector machines. IEEE Workshop on Neural Networks for Signal Processing.
|
 |
14
|
Dmitry Pavlov , Darya Chudova , Padhraic Smyth, Towards scalable support vector machines using squashing, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.295-299, August 20-23, 2000, Boston, Massachusetts, United States
[doi> 10.1145/347090.347155]
|
| |
15
|
Platt, J. (1998). Sequential minimal optimization: A fast algorithm for training support vector machines (Technical Report). Microsoft Research.
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
 |
19
|
|
|