|
ABSTRACT
Currently, most research on nonnegative matrix factorization (NMF)focus on 2-factor $X=FG^T$ factorization. We provide a systematicanalysis of 3-factor $X=FSG^T$ NMF. While it unconstrained 3-factor NMF is equivalent to it unconstrained 2-factor NMF, itconstrained 3-factor NMF brings new features to it constrained 2-factor NMF. We study the orthogonality constraint because it leadsto rigorous clustering interpretation. We provide new rules for updating $F,S, G$ and prove the convergenceof these algorithms. Experiments on 5 datasets and a real world casestudy are performed to show the capability of bi-orthogonal 3-factorNMF on simultaneously clustering rows and columns of the input datamatrix. We provide a new approach of evaluating the quality ofclustering on words using class aggregate distribution andmulti-peak distribution. We also provide an overview of various NMF extensions andexamine their relationships.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
J.-P. Brunet, P. Tamayo, T. R. Golub, and J. P. Mesirov. Metagenes and molecular pattern discovery using matrix factorization. Proc. Nat'l Academy of Sciences USA, 102(12):4164--4169, 2004.
|
| |
4
|
M. Chessell. Specification: Common base event, 2003. http://www-128.ibm.com/developerworks/webservices/library/ws-cbe/.
|
| |
5
|
M. Cooper and J. Foote. Summarizing video using non-negative similarity matrix factorization. In Proc. IEEE Workshop on Multimedia Signal Processing, pages 25--28, 2002.
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
C. Ding, X. He, and H. D. Simon. On the equivalence of nonnegative matrix factorization and spectral clustering. Proc. SIAM Data Mining Conf, 2005.
|
| |
10
|
|
| |
11
|
C. Ding, T. Li, and M. Jordan. Convex and semi-nonnegative matrix factorizations for clustering and low-dimension representation. Technical Report LBNL-60428, Lawrence Berkeley National Laboratory, University of California, Berkeley, 2006.
|
| |
12
|
C. Ding, T. Li, and W. Peng. Nonnegative matrix factorization and probabilistic latent semantic indexing: Equivalence, chi-square statistic, and a hybrid method. In AAAI, 2006.
|
 |
13
|
Eui-Hong Han , Daniel Boley , Maria Gini , Robert Gross , Kyle Hastings , George Karypis , Vipin Kumar , Bamshad Mobasher , Jerome Moore, WebACE: a Web agent for document categorization and exploration, Proceedings of the second international conference on Autonomous agents, p.408-415, May 10-13, 1998, Minneapolis, Minnesota, United States
[doi> 10.1145/280765.280872]
|
| |
14
|
T. Hofmann. Probabilistic latent semantic analysis. In UAI, pages 289--296, 1999.
|
| |
15
|
|
| |
16
|
|
 |
17
|
|
| |
18
|
D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401:788--791, 1999.
|
| |
19
|
D. D. Lee and H. S. Seung. Algorithms for non-negatvie matrix factorization. In NIPS 13, 2001.
|
| |
20
|
S. Z. Li, X. Hou, H. Zhang, and Q. Cheng. Learning spatially localized, parts-based representation. In Proceedings of IEEE Computer Vision and Pattern Recognition, pages 207--212, 2001.
|
 |
21
|
|
 |
22
|
Tao Li , Feng Liang , Sheng Ma , Wei Peng, An integrated framework on mining logs files for computing system management, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
[doi> 10.1145/1081870.1081972]
|
 |
23
|
|
 |
24
|
|
| |
25
|
A. K. McCallum. Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/ mccallum/bow, 1996.
|
| |
26
|
G. W. Milligan and M. C. Cooper. A study of the comparability of external criteria for hierarchical cluster analysis. Multivar Behav Res, 21:846--850, 1986.
|
| |
27
|
P. Paatero and U. Tapper. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics, 5:111--126, 1994.
|
| |
28
|
H. Park and P. Howland. Generalizing discriminant analysis using the generalized singular value decomposition. IEEE. Trans. on Pattern Analysis and Machine Intelligence, 26:995--1006, 2004.
|
| |
29
|
W. M. Rand. Objective criteria for the evaluation of clustering methods. J Am Stat Assoc, 66:846--850, 1971.
|
| |
30
|
F. Sha, L. K. Saul, and D. D. Lee. Multiplicative updates for nonnegative quadratic programming in support vector machines. In NIPS 15, pages 1041--1048. 2003.
|
| |
31
|
|
 |
32
|
|
| |
33
|
D. Zeimpekis and E. Gallopoulos. Clsi: A flexible approximation scheme from clustered term-document matrices. Proc. SIAM Data Mining Conf, pages 631--635, 2005.
|
| |
34
|
H. Zha, C. Ding, M. Gu, X. He, and H. D. Simon. Spectral relaxation for K-means clustering. NIPS 14, pages 1057--1064, 2002.
|
 |
35
|
Hongyuan Zha , Xiaofeng He , Chris Ding , Horst Simon , Ming Gu, Bipartite graph partitioning and data clustering, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA
[doi> 10.1145/502585.502591]
|
| |
36
|
|
CITED BY 25
|
|
|
|
|
|
|
|
|
|
|
Xin Li , William K. W. Cheung , Jiming Liu , Zhili Wu, A novel orthogonal NMF-based belief compression for POMDPs, Proceedings of the 24th international conference on Machine learning, p.537-544, June 20-24, 2007, Corvalis, Oregon
|
|
|
Bo Long , Zhongfei (Mark) Zhang , Xiaoyun Wu , Philip S. Yu, Relational clustering by symmetric convex coding, Proceedings of the 24th international conference on Machine learning, p.569-576, June 20-24, 2007, Corvalis, Oregon
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lei Tang , Huan Liu , Jianping Zhang , Zohreh Nazeri, Community evolution in dynamic multi-mode networks, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
Ka Cheung Sia , Junghoo Cho , Yun Chi , Belle L. Tseng, Efficient computation of personal aggregate queries on blogs, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
Dingding Wang , Shenghuo Zhu , Tao Li , Yun Chi , Yihong Gong, Integrating clustering and multi-document summarization to improve document understanding, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
Bin Li , Qiang Yang , Xiangyang Xue, Transfer learning for collaborative filtering via a rating-matrix generative model, Proceedings of the 26th Annual International Conference on Machine Learning, p.617-624, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Tao Li , Vikas Sindhwani , Chris Ding , Yi Zhang, Knowledge transformation for cross-domain sentiment classification, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
|
|
|
|
|