ACM Home Page
Please provide us with feedback. Feedback
Fast computation of low rank matrix approximations
Full text PdfPdf (223 KB)
Source Annual ACM Symposium on Theory of Computing archive
Proceedings of the thirty-third annual ACM symposium on Theory of computing table of contents
Hersonissos, Greece
Pages: 611 - 618  
Year of Publication: 2001
ISBN:1-58113-349-9
Authors
Dimitris Achlioptas  Microsoft Research, One Microsoft Way, Redmond, WA
Frank McSherry  Computer Science and Engineering, University of Washington, Seattle, WA
Sponsor
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 54,   Citation Count: 25
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/380752.380858
What is a DOI?

ABSTRACT

Given a matrix A it is often desirable to find an approximation to A that has low rank. We introduce a simple technique for accelerating the computation of such approximations when A has strong spectral structure, i.e., when the singular values of interest are significantly greater than those of a random matrix with size and entries similar to A. Our technique amounts to independently sampling and/or quantizing the entries of A, thus speeding up computation by reducing the number of non-zero entries and/or the length of their representation. Our analysis is based on observing that the acts of sampling and quantization can be viewed as adding a random matrix E to A, whose entries are independent random variables with zero-mean and bounded variance. Since, with high probability, E has very weak spectral structure, we can prove that the effect of sampling and quantization nearly vanishes when a low rank approximation to A+E is computed. In fact, the stronger the spectral structure of A, the more of its entries we can afford to discard and, ultimately, the faster we can discover that structure. We give bounds on the quality of our approximation both in the L2 and in the Frobenius norm.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Yossi Azar, Amos Fiat, Anna Karlin, Frank McSherry, and Jared Saia, Data mining through spectral analysis, These Proceedings.
 
2
 
3
 
4
Scott Deerwester, Susan T. Dumais, Thomas K. Landauer, George W. Furnas, and Richard A. Harshman, Indexing by latent semantic analysis, J. Soc. Inf. Sci. 41 (1990), no. 6, 391-407.
 
5
 
6
 
7
Zoltan F.uredi and Janos Komlos, The eigenvalues of random symmetric matrices, Combinatorica 1 (1981), no. 3, 233-241.
 
8
 
9
Ferenc Juhasz, On the spectrum of a random graph, Algebraic methods in graph theory, Vol. I, II (Szeged, 1978), North-Holland, Amsterdam, 1981, pp. 313-316.
10
 
11
Michael Krivelevich and Van H. Vu, On the concentration of eigenvalues of random symmetric matrices, Tech. Report 60, Microsoft Research, 2000.
12
 
13
Matthew Turk and Alex Pentland, Eigenfaces for recognition, J. Cogn. Neurosci. 3 (1991), no. 1, 71-86.
 
14
Eugene P. Wigner, On the distribution of the roots of certain symmetric matrices, Ann. of Math. (2) 67 (1958), 325-327.

CITED BY  25
 
 
 
 
 
 
 
 
 

Collaborative Colleagues:
Dimitris Achlioptas: colleagues
Frank McSherry: colleagues

Peer to Peer - Readers of this Article have also read: