ACM Home Page
Please provide us with feedback. Feedback
Interpretable nonnegative matrix decompositions
Full text PdfPdf (369 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Las Vegas, Nevada, USA
SESSION: Research papers table of contents
Pages 345-353  
Year of Publication: 2008
ISBN:978-1-60558-193-4
Authors
Saara Hyvönen  Fonecta Ltd., Helsinki, Finland
Pauli Miettinen  University of Helsinki, Helsinki, Finland
Evimaria Terzi  IBM Almaden, San Jose, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 184,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1401890.1401935
What is a DOI?

ABSTRACT

A matrix decomposition expresses a matrix as a product of at least two factor matrices. Equivalently, it expresses each column of the input matrix as a linear combination of the columns in the first factor matrix. The interpretability of the decompositions is a key issue in many data-analysis tasks. We propose two new matrix-decomposition problems: the nonnegative CX and nonnegative CUR problems, that give naturally interpretable factors. They extend the recently-proposed column and column-row based decompositions, and are aimed to be used with nonnegative matrices. Our decompositions represent the input matrix as a nonnegative linear combination of a subset of its columns (or columns and rows).

We present two algorithms to solve these problems and provide an extensive experimental evaluation where we assess the quality of our algorithms' results as well as the intuitiveness of nonnegative CX and CUR decompositions. We show that our algorithms return intuitive answers with smaller reconstruction errors than the previously-proposed methods for column and column-row decompositions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. W. Berry et al. Algorithms and applications for approximate nonnegative matrix factorization. Computational Statistics & Data Analysis, 52(1):155--173, 2007.
2
 
3
A. ¸ Civril and M. Magdon-Ismail. Finding maximum volume sub-matrices of a matrix. Technical Report 07-08, Department of Computer Science, Rensselaer Polytechnic Institute, 2007.
 
4
S. C. Deerwester et al. Indexing by latent semantic analysis. J. of the American Society for Information Science, 41(6):391--407, 1990.
 
5
P. Drineas, M. W. Mahoney, and S. Muthukrishnan. Subspace sampling and relative-error matrix approximation: Column-based methods. In APPROX-RANDOM, pages 316--326, 2006.
 
6
 
7
P. Drineas, M. W. Mahoney, and S. Muthukrishnan. Relative-error CUR matrix decompositions, Aug. 2007. arXiv:0708.3696v1 {cs.DS}.
 
8
S. M. Embleton and E. S. Wheeler. Finnish dialect atlas for quantitative studies. J. of Quantitative Linguistics, 4(1-3):99--102, 1997.
 
9
S. M. Embleton and E. S. Wheeler. Computerized dialect atlas of Finnish: Dealing with ambiguity. J. of Quantitative Linguistics, 7(3):227--231, 2000.
 
10
 
11
G. H. Golub and C. F. Van Loan. Matrix Computations. JHU Press, 1996.
 
12
M. K. Kozlov, S. P. Tarasov, and L. G. Hačijan. Polynomial solvability of convex quadratic programming. Soviet Math. Dokl., 20(5):1108--1111, 1979.
 
13

Collaborative Colleagues:
Saara Hyvönen: colleagues
Pauli Miettinen: colleagues
Evimaria Terzi: colleagues