ACM Home Page
Please provide us with feedback. Feedback
Mobile call graphs: beyond power-law and lognormal distributions
Full text MovMov (14:32),  PdfPdf (568 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Las Vegas, Nevada, USA
SESSION: Research papers table of contents
Pages 596-604  
Year of Publication: 2008
ISBN:978-1-60558-193-4
Authors
Mukund Seshadri  Sprint, Burlingame, CA, USA
Sridhar Machiraju  Sprint, Burlingame, CA, USA
Ashwin Sridharan  Sprint, Burlingame, CA, USA
Jean Bolot  Sprint, Burlingame, CA, USA
Christos Faloutsos  Carnegie Mellon University, Pittsburgh, PA, USA
Jure Leskove  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 278,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1401890.1401963
What is a DOI?

ABSTRACT

We analyze a massive social network, gathered from the records of a large mobile phone operator, with more than a million users and tens of millions of calls. We examine the distributions of the number of phone calls per customer; the total talk minutes per customer; and the distinct number of calling partners per customer. We find that these distributions are skewed, and that they significantly deviate from what would be expected by power-law and lognormal distributions.

To analyze our observed distributions (of number of calls, distinct call partners, and total talk time), we propose PowerTrack , a method which fits a lesser known but more suitable distribution, namely the Double Pareto LogNormal (DPLN) distribution, to our data and track its parameters over time. Using PowerTrack , we find that our graph changes over time in a way consistent with a generative process that naturally results in the DPLN distributions we observe. Furthermore, we show that this generative process lends itself to a natural and appealing social wealth interpretation in the context of social networks such as ours. We discuss the application of those results to our model and to forecasting.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
L. A. N. Amaral, A. Scala, M. Barthélémy, and H. E. Stanley. Classes of small-world networks. Proceedings of the National Academy of Sciences, 97(21):11149--11152, 2000.
 
3
A.-L. Barabási and R. Albert. Emergence of scaling in random networks. Science, 286:509--512, 1999.
4
 
5
P. Boldi, B. Codenotti, M. Santini, and S. Vigna. Structural properties of the African Web. In International World Wide Web Conference, New York, NY, 2002. ACM Press.
 
6
 
7
A. Clauset. Power law distributions in empirical data. http://www.santafe.edu/~aaronc/powerlaws/
 
8
A. Clauset, C. R. Shalizi, and M. E. J. Newman. Power-law distributions in empirical data. ArXiv e-print 0706.1062v1, 2007.
 
9
10
 
11
R. Govindan and H. Tangmunarunkit. Heuristics for Internet map discovery. In IEEE INFOCOM, pages 1371--1380, Los Alamitos, CA, March 2000. IEEE Computer Society Press.
 
12
J.-P. Onnela, J. Saramaäki, J. Hyvöven, G. Szabó, M. Argollo de Menezes, K. Kaski, and A.-L. Barabási. Structure and Tie Strengths in Mobile Communication Networks. New Journal of Physics, 9, 2007.
13
 
14
 
15
M. Mitzenmacher. A Brief History of Generative Models for Power Law and Lognormal Distributions. Internet Mathematics, 1(2):226--251.
 
16
M. Mitzenmacher. Dynamic Models for File Sizes and Double Pareto Distributions. Internet Mathematics, 1(3):305--334, 2004.
17
 
18
M. E. J. Newman. Power laws, pareto distributions and Zipf's law. Contemporary Physics, 46:323--351, 2005.
 
19
M. E. J. Newman. Power laws, pareto distributions and Zipf's law. Contemporary Physics, 46:323--351, 2005.
 
20
V. Pareto. Oeuvres Completes. Droz, Geneva, 1896.
 
21
D. M. Pennock, G. W. Flake, S. Lawrence, E. J. Glover, and C. L. Giles. Winners don't take all: Characterizing the competition for links on the Web. Proceedings of the National Academy of Sciences, 99(8):5207--5211, 2002.
 
22
S. Redner. How popular is your paper? an empirical study of the citation distribution. The European Physics Journal B, 4:131--134, 1998.
 
23
W. Reed and M. Jorgensen. The double pareto-lognormal distribution - a new parametric model for size distribution. Communications in Statistics -Theory and Methods, 33(8):1733--1753, 2004.
 
24
R.Gibrat. inégalités économiques. Librarie du Recuil Sirey, 1931.
 
25
G. Szabo and A.-L. Barabasi. Network effects in service usage. ArXiv e-prints physics/0611177, November 2006.


Collaborative Colleagues:
Mukund Seshadri: colleagues
Sridhar Machiraju: colleagues
Ashwin Sridharan: colleagues
Jean Bolot: colleagues
Christos Faloutsos: colleagues
Jure Leskove: colleagues