ACM Home Page
Please provide us with feedback. Feedback
Topic modeling with network regularization
Full text PdfPdf (717 KB)
Source
International World Wide Web Conference archive
Proceeding of the 17th international conference on World Wide Web table of contents
Beijing, China
SESSION: Data mining: modeling table of contents
Pages 101-110  
Year of Publication: 2008
ISBN:978-1-60558-085-2
Authors
Qiaozhu Mei  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Deng Cai  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Duo Zhang  University of Illinois at Urbana-Champaign, Urbana, IL, USA
ChengXiang Zhai  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 36,   Downloads (12 Months): 266,   Citation Count: 11
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1367497.1367512
What is a DOI?

ABSTRACT

In this paper, we formally define the problem of topic modeling with network structure (TMN). We propose a novel solution to this problem, which regularizes a statistical topic model with a harmonic regularizer based on a graph structure in the data. The proposed method bridges topic modeling and social network analysis, which leverages the power of both statistical topic models and discrete regularization. The output of this model well summarizes topics in text, maps a topic on the network, and discovers topical communities. With concrete selection of a topic model and a graph-based regularizer, our model can be applied to text mining problems such as author-topic analysis, community discovery, and spatial text mining. Empirical experiments on two different genres of data show that our approach is effective, which improves text-oriented methods as well as network-oriented methods. The proposed model is general; it can be applied to any text collections with a mixture of topics and an associated network structure.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
S. Borgatti, M. Everett, and L. Freeman. Ucinet for windows: Software for social network analysis. Harvard: Analytic Technologies, 2002.
6
 
7
D. A. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In NIPS, 2000.
 
8
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statist. Soc. B, 39:1--38, 1977.
9
10
11
 
12
13
 
14
J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, and M. Hurst. Cascading behavior in large blog graphs. In Proceeding of SDM '07, 2007.
15
16
 
17
A. McCallum, A. Corrada-Emmanuel, and X. Wang. Topic and role discovery in social networks. In IJCAI, pages 786--791, 2005.
 
18
G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. Wiley, 1997.
19
20
 
21
 
22
M. Richardson and P. Domingos. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In NIPS, pages 1441--1448, 2002.
 
23
 
24
 
25
L. Si and R. Jin. Adjusting mixture weights of gaussian mixture model via regularized probabilistic latent semantic analysis. In PAKDD, pages 622--631, 2005.
26
27
28
 
29
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In NIPS, 2004.
30
31
 
32
D. Zhou and B. Schölkopf. Discrete regularization. Semi-supervised learning, pages 221--232, 2006.
 
33
X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, pages 912--919, 2003.
34

CITED BY  11

Collaborative Colleagues:
Qiaozhu Mei: colleagues
Deng Cai: colleagues
Duo Zhang: colleagues
ChengXiang Zhai: colleagues