ACM Home Page
Please provide us with feedback. Feedback
Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices
Full text PdfPdf (175 KB)
Source ACM Transactions on Mathematical Software (TOMS) archive
Volume 31 ,  Issue 2  (June 2005) table of contents
Pages: 252 - 269  
Year of Publication: 2005
ISSN:0098-3500
Authors
Michael W. Berry  University of Tennessee, Knoxville, Knoxville, TN
Shakhina A. Pulatova  University of Tennessee, Knoxville, Knoxville, TN
G. W. Stewart  University of Maryland, College Park, College Park, MD
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 132,   Citation Count: 4
Additional Information:

appendices and supplements   abstract   references   cited by   index terms   reviews   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1067967.1067972
What is a DOI?

APPENDICES and SUPPLEMENTS
Zip844.zip (9 KB)
Software for "Computing sparse reduced-rank approximations to sparse matrices"


ABSTRACT

In many applications---latent semantic indexing, for example---it is required to obtain a reduced rank approximation to a sparse matrix A. Unfortunately, the approximations based on traditional decompositions, like the singular value and QR decompositions, are not in general sparse. Stewart [(1999), 313--323] has shown how to use a variant of the classical Gram--Schmidt algorithm, called the quasi--Gram-Schmidt--algorithm, to obtain two kinds of low-rank approximations. The first, the SPQR, approximation, is a pivoted, Q-less QR approximation of the form (XR11−1)(R11 R12), where X consists of columns of A. The second, the SCR approximation, is of the form the form AXTYT, where X and Y consist of columns and rows A and T, is small. In this article we treat the computational details of these algorithms and describe a MATLAB implementation.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
Berry, M. W. and Martin, D. I. 2004. Principal component analysis for information retrieval. In Handbook of Parallel Computing and Statistics. Marcel Dekker, New York. To appear.
 
5
Björck, Å. 1996. Numerical Methods for Least Squares Problems. SIAM, Philadelphia.
 
6
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., and Harshman, R. 1990. Indexing by latent semantic analysis. J. Amer. Soc. Info. Sci. 41, 391--407.
 
7
Jiang, P. and Berry, M. W. 2000. Solving total least squares problems in information retrieval. Lin. Alg. Appl. 316, 137--156.
 
8
Saad, Y. 1994. SPARSEKIT: A basic tool kit for sparse matrix computations. Available at www-users.cs.umn.edu/~saad/software/SPARSKIT/sparskit.html.
 
9
Stewart, G. W. 1980. The efficient generation of random orthogonal matrices with an application to condition estimators. SIAM J. Num. Anal. 17, 403--409.
 
10
Stewart, G. W. 1998. Matrix Algorithms I: Basic Decompositions. SIAM, Philadelphia.
 
11
Stewart, G. W. 1999. Four algorithms for the the efficient computation of truncated pivoted qr approximations to a sparse matrix. Numerische Mathematik 83, 313--323.
 
12
 
13
Stewart, G. W. 2004. Error analysis of the quasi-Gram--Schmidt algorithm. Tech. Rep. CMSC TR-4572, Department of Computer Science, University of Maryland.
 
14
Stuart, G. W. and Berry, M. W. 2003. A comprehensive whole genome bacterial phylogeny using correlated peptide motifs defined in a high dimensional vector space. J. Bioinformatics and Computational Bio. 1, 475--493.
 
15



REVIEWS

"Zahari Zlatev : Reviewer"

Reduced-rank approximations of sparse matrices lead to savings in both storage and computing time. Therefore, the use of such approximations is crucial in efforts to handle very large sparse matrices that arise in several fields of science and eng  more...


"Tugrul Dayar : Reviewer"

Obtaining a sparse reduced-rank approximation for a large, sparse rectangular matrix is a problem that comes up in many application areas, one of which is latent semantic indexing. Therein, the objective is to compute a document vector correspondi  more...

Collaborative Colleagues:
Michael W. Berry: colleagues
Shakhina A. Pulatova: colleagues
G. W. Stewart: colleagues