ACM Home Page
Please provide us with feedback. Feedback
Weighted graphs and disconnected components: patterns and a generator
Full text MovMov (24:42),  PdfPdf (610 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Las Vegas, Nevada, USA
SESSION: Research papers table of contents
Pages 524-532  
Year of Publication: 2008
ISBN:978-1-60558-193-4
Authors
Mary McGlohon  Carnegie Mellon University, Pittsburgh, PA, USA
Leman Akoglu  Carnegie Mellon University, Pittsburgh, PA, USA
Christos Faloutsos  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 357,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1401890.1401955
What is a DOI?

ABSTRACT

The vast majority of earlier work has focused on graphs which are both connected (typically by ignoring all but the giant connected component), and unweighted. Here we study numerous, real, weighted graphs, and report surprising discoveries on the way in which new nodes join and form links in a social network. The motivating questions were the following: How do connected components in a graph form and change over time? What happens after new nodes join a network -- how common are repeated edges? We study numerous diverse, real graphs (citation networks, networks in social media, internet traffic, and others); and make the following contributions: (a) we observe that the non-giant connected components seem to stabilize in size, (b) we observe the weights on the edges follow several power laws with surprising exponents, and (c) we propose an intuitive, generative model for graph growth that obeys observed patterns.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Albert and A.-L. Barabasi. Statistical mechanics of complex networks. Reviews of Modern Physics, 74:47, 2002.
 
2
R. Albert, H. Jeong, and A.-L. Barabasi. Diameter of the world wide web. Nature, (401):130--131, 1999.
 
3
A. L. Barabasi and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509--512, October 1999.
4
5
 
6
G. Casella and R. L. Berger. Statistical Inference. Duxbury, 2002.
7
 
8
9
 
10
P. Erdos and A. Renyi. On the evolution of random graphs. Publ. Math. Inst. Hungary. Acad. Sci., 5:17--61, 1960.
 
11
A. Fabrikant, E. Koutsoupias, and C. H. Papadimitriou. Heuristically optimized trade-offs: A new paradigm for power laws in the internet (extended abstract), 2002.
12
13
14
 
15
A. Lazarevic, L. Ert¨oz, V. Kumar, A. Ozgur, and J. Srivastava. A comparative study of anomaly detection schemes in network intrusion detection. In Proceedings of the Third SIAM International Conference on Data Mining, 2003.
 
16
J. Leskovec, D. Chakrabarti, J. M. Kleinberg, and C. Faloutsos. Realistic, mathematically tractable graph generation and evolution, using kronecker multiplication. PKDD, pages 133--145, 2005.
17
 
18
J. Leskovec, K. Lang, A. Dasgupta, and M. Mahoney. Community structure in real graphs: The 'negative dimensionality' paradox. In International World Wide Web Conference, 2008.
 
19
J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, and M. Hurst. Cascading behavior in large blog graphs: Patterns and a model. In Society of Applied and Industrial Mathematics: Data Mining, 2007.
 
20
S. Milgram. The small-world problem. Psychology Today, 2:60--67, 1967.
 
21
M. E. J. Newman. Power laws, pareto distributions and zipf's law, December 2004.
22
23
 
24
S.-T. Park, D. M. Pennock, and C. L. Giles. Comparing static and dynamic measurements and models of the internet's as topology. In INFOCOM, 2004.
 
25
D. M. Pennock, G. W. Flake, S. Lawrence, E. J. Glover, and C. L. Giles. Winners don't take all: Characterizing the competition for links on the web. Proceedings of the National Academy of Sciences, 99(8):5207--5211, 2002.
 
26
S. Redner. Citation statistics from more than a century of physical review, Oct 2004.
 
27
M. Richardson and P. Domingos. Mining knowledge-sharing sites for viral marketing, 2002.
 
28
M. Schroeder. Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. W.H. Freeman and Company, New York, 1991.
 
29
M. Wang, T. Madhyastha, N. H. Chang, S. Papadimitriou, and C. Faloutsos. Data mining meets performance evaluation: Fast algorithms for modeling bursty traffic. ICDE, Feb. 2002.
 
30
D. J. Watts and S. H. Strogatz. Collective dynamics of 'small-world' networks. Nature, (393):440--442, 1998.


Collaborative Colleagues:
Mary McGlohon: colleagues
Leman Akoglu: colleagues
Christos Faloutsos: colleagues