ACM Home Page
Please provide us with feedback. Feedback
Integrating clustering and multi-document summarization to improve document understanding
Full text PdfPdf (173 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session 2/information retrieval table of contents
Pages 1435-1436  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Dingding Wang  Florida International University, Miami, FL, USA
Shenghuo Zhu  NEC Laboratories America, Cupertino, CA, USA
Tao Li  Florida International University, Miami, FL, USA
Yun Chi  NEC Laboratories America, Cupertino, CA, USA
Yihong Gong  NEC Laboratories America, Cupertino, CA, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 16,   Downloads (12 Months): 178,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458319
What is a DOI?

ABSTRACT

Document understanding techniques such as document clustering and multi-document summarization have been receiving much attention in recent years. Current document clustering methods usually represent documents as a term-document matrix and perform clustering algorithms on it. Although these clustering methods can group the documents satisfactorily, it is still hard for people to capture the meanings of the documents since there is no satisfactory interpretation for each document cluster. In this paper, we propose a new language model to simultaneously cluster and summarize the documents. By utilizing the mutual influence of the document clustering and summarization, our method makes (1) a better document clustering method with more meaningful interpretation and (2) a better document summarization method taking the document context information into consideration.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Cho, I. Dhillon, Y. Guan, and S. Sra. Minimum sum squared residue co-clustering. In Proceedings of SIAM Data Mining 2004.
2
3
4
 
5
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In Proceedings of NIPS 2001.
6

Collaborative Colleagues:
Dingding Wang: colleagues
Shenghuo Zhu: colleagues
Tao Li: colleagues
Yun Chi: colleagues
Yihong Gong: colleagues