ACM Home Page
Please provide us with feedback. Feedback
Matrix approximation and projective clustering via volume sampling
Full text PdfPdf (262 KB)
Source Symposium on Discrete Algorithms archive
Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm table of contents
Miami, Florida
Pages: 1117 - 1126  
Year of Publication: 2006
ISBN:0-89871-605-5
Authors
Amit Deshpande  CSAIL, MIT
Luis Rademacher  CSAIL, MIT
Santosh Vempala  CSAIL, MIT
Grant Wang  CSAIL, MIT
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
: SIAM Activity Group on Discrete Mathematics
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 60,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1109557.1109681
What is a DOI?

ABSTRACT

Frieze et al. [17] proved that a small sample of rows of a given matrix A contains a low-rank approximation D that minimizes ||A - D||F to within small additive error, and the sampling can be done efficiently using just two passes over the matrix [12]. In this paper, we generalize this result in two ways. First, we prove that the additive error drops exponentially by iterating the sampling in an adaptive manner. Using this result, we give a pass-efficient algorithm for computing low-rank approximation with reduced additive error. Our second result is that using a natural distribution on subsets of rows (called volume sampling), there exists a subset of k rows whose span contains a factor (k + 1) relative approximation and a subset of k + k(k + 1)/ε rows whose span contains a 1+ε relative approximation. The existence of such a small certificate for multiplicative low-rank approximation leads to a PTAS for the following projective clustering problem: Given a set of points P in Rd, and integers k, j, find a set of j subspaces F1, . . ., Fj, each of dimension at most k, that minimize Σp∈Pmini d(p, Fi)2.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
P. Agarwal, S. Har-Peled, K. Varadajan, Geometric approximations via coresets. Manuscript, 2004. http://valis.cs.uiuc.edu/~sariel/papers/04/survey/.
4
5
6
 
7
 
8
M. Artin. Algebra, Prentice-Hall, 1991.
9
10
11
 
12
 
13
 
14
P. Drineas, R. Kannan, M. Maloney, Fast Monte Carlo algorithms for matrices II: computing a low-rank approximation to a matrix. Yale University Technical Report, YALEU/DCS/TR-1270, 2004.
 
15
M. Effros, L. J. Schulman, Deterministic clustering with data nets, ECCC TR04--050, 2004.
 
16
J. Feigenbaum, S. Kannan, A. McGregor, S. Suri, J. Zhang, On graph problems in a semi-streaming model. Proceedings of the 31st ICALP, 2004.
17
 
18
G. Golub, C. Van Loan, Matrix Computations. John Hopkins University Press, third edition, 1996.
 
19
S. A. Goreinov, E. E. Tyrtyshnikov, The maximal-volume concept in approximation by low-rank matrices Contemporary Mathematics, Vol. 280 (2001), 47--51.
20
21
22
 
23
M. Henzinger, P. Raghavan, S. Rajagopalan, Computing on data streams. Technical Note 1998--011, Digital Systems Research Center, Palo Alto, CA, May 1998.
24
 
25
 
26
J. Matoušek, On approximate geometric k-clustering. Discrete and Computational Geometry, 61--84, 2000.
 
27
N. Megiddo, A. Tamir, On the complexity of locating linear facilities in the plane. Operations Research Letters, 1 (1982), 194--197.
28
29
 
30
L. Rademacher, S. Vempala, G. Wang, Matrix approximation and projective clustering via iterative sampling. MIT-LCS-TR-983, 2005.

CITED BY  9

Collaborative Colleagues:
Amit Deshpande: colleagues
Luis Rademacher: colleagues
Santosh Vempala: colleagues
Grant Wang: colleagues