|
ABSTRACT
We study the dynamics of information propagation in environments of low-overhead personal publishing, using a large collection of weblogs over time as our example domain. We characterize and model this collection at two levels. First, we present a macroscopic characterization of topic propagation through our corpus, formalizing the notion of long-running "chatter" topics consisting recursively of "spike" topics generated by outside world events, or more rarely, by resonances within the community. Second, we present a microscopic characterization of propagation from individual to individual, drawing on the theory of infectious diseases to model the flow. We propose, validate, and employ an algorithm to induce the underlying propagation network from a sequence of posts, and report on the results.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Reka Albert, Hawoong Jeong, and Albert-Laszlo Barabasi. Error and attack tolerance of complex networks. Nature, 406, July 2000.
|
| |
3
|
|
| |
4
|
Norman Bailey. The Mathematical Theory of Infectious Diseases and its Applications. Griffin, London, 2nd edition, 1975.
|
| |
5
|
Venkatesh Bala and Sanjeev Goyal. A strategic analysis of network reliability. Review of Economic Design, 5:205--228, 2000.
|
| |
6
|
Bela Bollabas and Oliver Riordan. Robustness and vulnerability of scale-free random graphs. Internet Mathematics, 1(1), 2003.
|
| |
7
|
Daypop. http://www.daypop.com.
|
| |
8
|
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. Royal Stat. Soc., 1977.
|
| |
9
|
Jared Diamond. Guns, Germs, and Steel. Random House, 1997.
|
| |
10
|
Victor M. Eguiluz and Konstantin Klemm. Epidemic threshold in structured scale-free networks. Physical Review Letters, 89, 2002. cond-mat/0205439.
|
| |
11
|
Michelle Girvan, Duncan S. Callaway, M. E. J. Newman, and Steven H. Strogatz. A simple model of epidemics with pathogen mutation. Phys. Rev. E, 65(031915), 2002. nlin.CD/0105044.
|
| |
12
|
Malcolm Gladwell. The Tipping Point: How little things can make a big difference. Little Brown & Co., 2000.
|
| |
13
|
Jacob Goldenberg, Barak Libai, and Eitan Muller. Talk of the network: A complex systems look at the underlying process of word-of-mouth. Marketing Letters, 12(3):211--223, 2001.
|
| |
14
|
Mark Granovetter. Threshold models of collective behavior. American Journal of Sociology, 83(6):1420--1443, 1987.
|
| |
15
|
R. V. Guha and Rob McCool. TAP: A system for integrating web services into a global knowledge base.
|
| |
16
|
Hans Haller and Sudipta Sarangi. Nash networks with heterogeneous agents. Working Paper Series E-2001-1, Virginia Tech, 2003.
|
 |
17
|
|
| |
18
|
Andrew King. The evolution of RSS. http://www.webreference.com/authoring/languages/xml/rss/1/.
|
 |
19
|
|
| |
20
|
M. Mitzenmacher. A brief history of lognormal and power law distributions. In Allerton Comm. Control Comput., 2001.
|
| |
21
|
Cristopher Moore and M. E. J. Newman. Epidemics and percolation in small-world networks. Phys. Rev. E, 61:5678--5682, 2000. cond-mat/9911492.
|
| |
22
|
Stephen Morris. Contagion. Review of Economic Studies, 67, 2000.
|
| |
23
|
M. E. J. Newman. The spread of epidemic disease on networks. Phys. Rev. E, 66(016128), 2002. cond-mat/0205009.
|
| |
24
|
M. E. J. Newman, Stephanie Forrest, and Justin Balthrop. Email networks and the spread of computer viruses. Phys. Rev. E, 66(035101), 2002.
|
| |
25
|
Romauldo Pasto-Satorras and Alessandro Vespignani. Epidemic spreading in scale-free networks. Phys. Rev. Letters, 86(14):3200--3203, April 2001.
|
| |
26
|
|
| |
27
|
Topic Detection and Tracking (TDT-2003). http://www.nist.gov/TDT.
|
| |
28
|
D. Watts and S. Strogatz. Collective dynamics of 'small-world' networks. Nature, 393:440--442, 1998.
|
| |
29
|
WebFountain. http://www.almaden.ibm.com/WebFountain/.
|
| |
30
|
Fang Wu, Bernardo A. Huberman, Lada A. Adamic, and Joshua R. Tyler. Information flow in social groups. Manuscript, 2003.
|
| |
31
|
H. Peyton Young. The diffusion of innovation in social networks. Sante Fe Institute Working Paper 02-04-018, 2002.
|
CITED BY 59
|
|
|
|
|
Daniel Gruhl , R. Guha , Ravi Kumar , Jasmine Novak , Andrew Tomkins, The predictive power of online chatter, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Daniel Gruhl , Daniel N. Meredith , Jan H. Pieper , Alex Cozzi , Stephen Dill, The web beyond popularity: a really simple system for web scale RSS, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
Xiaodan Song , Belle L. Tseng , Ching-Yung Lin , Ming-Ting Sun, Personalized recommendation driven by information flow, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
Jure Leskovec , Lada A. Adamic , Bernardo A. Huberman, The dynamics of viral marketing, Proceedings of the 7th ACM conference on Electronic commerce, p.228-237, June 11-15, 2006, Ann Arbor, Michigan, USA
|
|
|
|
|
|
|
|
|
Ding Zhou , Xiang Ji , Hongyuan Zha , C. Lee Giles, Topic evolution and social interactions: how authors effect research, Proceedings of the 15th ACM international conference on Information and knowledge management, November 06-11, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
Xiaodan Song , Yun Chi , Koji Hino , Belle L. Tseng, Information flow modeling based on diffusion rate for prediction and ranking, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
Jure Leskovec , Andreas Krause , Carlos Guestrin , Christos Faloutsos , Jeanne VanBriesen , Natalie Glance, Cost-effective outbreak detection in networks, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Qiaozhu Mei , Xu Ling , Matthew Wondra , Hang Su , ChengXiang Zhai, Topic sentiment mixture: modeling facets and opinions in weblogs, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
Xiaochuan Ni , Gui-Rong Xue , Xiao Ling , Yong Yu , Qiang Yang, Exploring in the weblog space by detecting informative and affective articles, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
Josiane Xavier Parreira , Sebastian Michel , Matthias Bender , Tom Crecelius , Gerhard Weikum, P2P authority analysis for social communities, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
Jon M. Kleinberg, Challenges in mining social network data: processes, privacy, and paradoxes, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, p.4-5, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
Candida Tauro , Sameer Ahuja , Manuel A. Pérez-Quiñones , Andrea Kavanaugh , Philip Isenhour, Deliberation in the wild: a visualization tool for blog discovery and citizen-to-citizen participation, Proceedings of the 2008 international conference on Digital government research, May 18-21, 2008, Montreal, Canada
|
|
|
|
|
|
|
|
|
|
|
|
Munmun De Choudhury , Hari Sundaram , Ajita John , Dorée Seligmann, Dynamic prediction of communication flow using social context, Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, June 19-21, 2008, Pittsburgh, PA, USA
|
|
|
Avaré Stewart , Ling Chen , Raluca Paiu , Wolfgang Nejdl, Discovering information diffusion paths from blogosphere for online advertising, Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising, p.46-54, August 12-12, 2007, San Jose, California
|
|
|
|
|
|
|
|
|
Mary Helander , Rick Lawrence , Yan Liu , Claudia Perlich , Chandan Reddy , Saharon Rosset, Looking for great ideas: analyzing the innovation jam, Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, p.66-73, August 12-12, 2007, San Jose, California
|
|
|
Yun Chi , Shenghuo Zhu , Xiaodan Song , Junichi Tatemura , Belle L. Tseng, Structural and temporal analysis of the blogosphere through community factorization, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
A. Scherrer , P. Borgnat , E. Fleury , J. -L. Guillaume , C. Robardet, Description and simulation of dynamic mobility networks, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.52 n.15, p.2842-2858, October, 2008
|
|
|
Xu Ling , Qiaozhu Mei , ChengXiang Zhai , Bruce Schatz, Mining multi-faceted overviews of arbitrary topics in a text collection, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
Dong Zhou , Mark Truran , Tim Brailsford , Helen Ashman , Amir Pourabdollah, Llama-b: automatic hyperlink authoring in the blogosphere, Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, June 19-21, 2008, Pittsburgh, PA, USA
|
|
|
|
|
|
Ka Cheung Sia , Junghoo Cho , Yun Chi , Belle L. Tseng, Efficient computation of personal aggregate queries on blogs, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lei Guo , Enhua Tan , Songqing Chen , Xiaodong Zhang , Yihong (Eric) Zhao, Analyzing patterns of user content generation in online social networks, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|
|
|
|
|
|
|
|
Masahiro Kimura , Kazumi Saito , Ryohei Nakano, Extracting influential nodes for information diffusion on a social network, Proceedings of the 22nd national conference on Artificial intelligence, p.1371-1376, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|