ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Colibri: fast mining of large static and dynamic graphs
Full text PdfPdf (256 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Las Vegas, Nevada, USA
SESSION: Research papers table of contents
Pages: 686-694  
Year of Publication: 2008
ISBN:978-1-60558-193-4
Authors
Hanghang Tong  Carnegie Mellon University, Pittsburgh, PA, USA
Spiros Papadimitriou  IBM T.J. Watson, Hawthorne, NY, USA
Jimeng Sun  IBM T.J. Watson, Hawthorne, NY, USA
Philip S. Yu  University of Illinois at Chicago, Chicago, IL, USA
Christos Faloutsos  Carnegie Mellon University, Pittsburgh, PA, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 160,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1401890.1401973
What is a DOI?

ABSTRACT

Low-rank approximations of the adjacency matrix of a graph are essential in finding patterns (such as communities) and detecting anomalies. Additionally, it is desirable to track the low-rank structure as the graph evolves over time, efficiently and within limited storage. Real graphs typically have thousands or millions of nodes, but are usually very sparse. However, standard decompositions such as SVD do not preserve sparsity. This has led to the development of methods such as CUR and CMD, which seek a non-orthogonal basis by sampling the columns and/or rows of the sparse matrix.

However, these approaches will typically produce overcomplete bases, which wastes both space and time. In this paper we propose the family of Colibri methods to deal with these challenges. Our version for static graphs, Colibri-S, iteratively finds a non-redundant basis and we prove that it has no loss of accuracy compared to the best competitors (CUR and CMD), while achieving significant savings in space and time: on real data, Colibri-S requires much less space and is orders of magnitude faster (in proportion to the square of the number of non-redundant columns). Additionally, we propose an efficient update algorithm for dynamic, time-evolving graphs, Colibri-D. Our evaluation on a large, real network traffic dataset shows that Colibri-D is over 100 times faster than the best published competitor (CMD).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
R. Albert, H. Jeong, and A.-L. Barabasi. Diameter of the world wide web. Nature, (401):130--131, 1999.
3
 
4
5
 
6
S. Dorogovtsev and J. Mendes. Evolution of networks. Advances in Physics, 51:1079--1187, 2002.
 
7
 
8
 
9
P. Drineas, M. W. Mahoney, and S. Muthukrishnan. Relative-error cur matrix decompositions. CoRR, abs/0708.3696, 2007.
10
 
11
12
 
13
M. Girvan and M. E. J. Newman. Community structure is social and biological networks.
 
14
G. H. Golub and C. F. Van-Loan. Matrix Computations. The Johns Hopkins University Press, Baltimore, 2nd edition, 1989.
15
 
16
17
18
19
 
20
M. E. J. Newman. The structure and function of complex networks. SIAM Review, 45:167--256, 2003.
21
22
 
23
 
24
25
 
26
27
 
28
J. Sun, Y. Xie, H. Zhang, and C. Faloutsos. Less is more: Compact matrix decomposition for large sparse graphs. In SDM, 2007.
 
29
 
30


Collaborative Colleagues:
Hanghang Tong: colleagues
Spiros Papadimitriou: colleagues
Jimeng Sun: colleagues
Philip S. Yu: colleagues
Christos Faloutsos: colleagues