|
ABSTRACT
Low-rank matrix approximation is an effective tool in alleviating the memory and computational burdens of kernel methods and sampling, as the mainstream of such algorithms, has drawn considerable attention in both theory and practice. This paper presents detailed studies on the Nyström sampling scheme and in particular, an error analysis that directly relates the Nyström approximation quality with the encoding powers of the landmark points in summarizing the data. The resultant error bound suggests a simple and efficient sampling scheme, the k-means clustering algorithm, for Nyström low-rank approximation. We compare it with state-of-the-art approaches that range from greedy schemes to probabilistic sampling. Our algorithm achieves significant performance gains in a number of supervised/unsupervised learning tasks including kernel PCA and least squares SVM.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
Baker, C. (1977). The numerical treatment of integral equations. Oxford: Clarendon Press.
|
| |
5
|
Belkin, M., & Niyogi, P. (2002). Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in Neural Information Processing Systems 14.
|
| |
6
|
Drineas, P., Drinea, E., & Huggins, P. (2003). An experimental evaluation of a Monte-Carlo algorithm for singular value decomposition. Proceedings of 8th Panhellenic Conference on Informatics (pp. 279--296).
|
| |
7
|
|
| |
8
|
Elkan, E. (2003). Using the triangular inequality to accelerate k-means. Proceedings of the 21th International Conference on Machine Learning (pp. 147--153).
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
|
| |
13
|
Golub, G., & Van Loan, C. (1996). Matrix computations. Johns Hopkins University Press. 3rd edition.
|
| |
14
|
Tapas Kanungo , David M. Mount , Nathan S. Netanyahu , Christine D. Piatko , Ruth Silverman , Angela Y. Wu, An Efficient k-Means Clustering Algorithm: Analysis and Implementation, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.24 n.7, p.881-892, July 2002
[doi> 10.1109/TPAMI.2002.1017616]
|
| |
15
|
Lawrence, N. Seeger, M., & Herbrich, R. (2003). Fast sparse Gaussian process methods: The informative vector machine. Advances in Neural Information Processing Systems. (pp. 625--632). MIT Press.
|
| |
16
|
Ouimet, M., & Bengio, Y. (2005). Greedy spectral embedding. Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (pp. 253--260).
|
| |
17
|
Platt, J. C. (2005). Fastmap, MetricMap, and Landmark MDS are all Nyström algorithms. Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (pp. 261--268).
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
Williams, C., & Seeger, M. (2001). Using the Nyströöm method to speed up kernel machines. Advances in Neural Information Processing Systems 13 (pp. 682--688).
|
 |
23
|
|
CITED BY 2
|
|
Kai Zhang , James T. Kwok , Bahram Parvin, Prototype vector machine for large scale semi-supervised learning, Proceedings of the 26th Annual International Conference on Machine Learning, p.1233-1240, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
Sanjiv Kumar , Mehryar Mohri , Ameet Talwalkar, On sampling-based approximate spectral decomposition, Proceedings of the 26th Annual International Conference on Machine Learning, p.553-560, June 14-18, 2009, Montreal, Quebec, Canada
|
|