|
ABSTRACT
Separation of connected components from a graph with disconnected graph components mostly use breadth-first search (BFS) or depth-first search (DFS) graph algorithms. Here we propose a new algebraic method to separate disconnected and nearly-disconnected components. This method is based on spectral graph partitioning, following a key observation that disconnected components will show up, after properly sorted, as step-function like curve in the lowest eigenvectors of the Laplacian matrix of the graph. Following an perturbative analysis framework, we systematically analyzed the graph structures, first on the disconnected subgraph case, and second on the effects of adding edges sparsely connecting different subgraphs as a perturbation. Several new results are derived, providing insights to spectral methods and related clustering objective function. Examples are given illustrating the concepts and results our methods. Comparing to the standard graph algorithms, this method has the same O(‖E ‖ + ‖V‖log(‖V‖)) complexity, but is easier to implement (using readily available eigensolvers). Further more the method can easily identify articulation points and bridges on nearly-disconnected graphs. Segmentation of a real example of Web graph for query amazon is given. We found that each disconnected or nearly-disconnected components forms a cluster on a clear topic.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
N. Biggs. Algebraic Graph Theory. Cambridge Univ. Press, 1974.
|
| |
4
|
Andrei Broder , Ravi Kumar , Farzin Maghoul , Prabhakar Raghavan , Sridhar Rajagopalan , Raymie Stata , Andrew Tomkins , Janet Wiener, Graph structure in the Web, Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking, p.309-320, June 2000, Amsterdam, The Netherlands
|
| |
5
|
S. Chakrabarti, B. E. Dom, and J. M. Kleinberg. Mining the link structure of the world wide web. Feb 1999.
|
| |
6
|
C.-K. Cheng and Y.A. Wei. An inlproved two-way partitioning algorithm with stable performance. IEEE. Trans. on Computed Aided Desgin , 10:1502-1511, 1991.
|
| |
7
|
F.R.K. Chung. Spectral Graph Theory. Amer. Math. Society., 1997.
|
| |
8
|
|
| |
9
|
C. Ding, X. He, H. Zha, M. Gu, and H. Simon. Spectral min-max cut for graph partitioning and data clustering. Lawrence Berkeley Nat'l Lab Tech report 47848, May 2001.
|
| |
10
|
W.E. Donath and A. J. Hoffman. Lower bounds for partitioning of graphs. IBM J. Res. Develop., 17:420-425, 1973.
|
| |
11
|
M. Fiedler. Algebraic connectivity of graphs. Czech. Math. l., 23:298-305, 1973.
|
| |
12
|
M. Fiedler. A property of eigenvectors of non-negative symmetric matrices and its application to graph theory. Czech. Math. J., 25:619-633, 1975.
|
 |
13
|
Gary William Flake , Steve Lawrence , C. Lee Giles, Efficient identification of Web communities, Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, p.150-160, August 20-23, 2000, Boston, Massachusetts, United States
[doi> 10.1145/347090.347121]
|
| |
14
|
|
 |
15
|
|
| |
16
|
L. Hagen and A.B. Kalmg. New spectral methods for ratio cut partitioning and clustering. IEEE. Trans. on Computed Aided Desgin, 11:1074-1085, 1992.
|
| |
17
|
X. He, H. Zha, C. Ding, and H.D. Simon. Web document clustering using hyperlink structures. Tech Report CSE-01-O06, April 2001.
|
 |
18
|
|
| |
19
|
R. R. Larson. Bibliometrics of the world wide web: an exploratory analysis of the intellectual structures of cyberspace. Proc. S1GIR'96, 1996.
|
| |
20
|
|
| |
21
|
J. Mathews and R.L. Walker. Mathematical Methods of Physics. Addison-Wesley, 1971.
|
| |
22
|
L. Page. Pagerank: bring order to the web. Stanford Digital Library working paper 199%0072, 1997.
|
| |
23
|
|
 |
24
|
Peter Pirolli , James Pitkow , Ramana Rao, Silk from a sow's ear: extracting usable structures from the Web, Proceedings of the SIGCHI conference on Human factors in computing systems: common ground, p.118-125, April 13-18, 1996, Vancouver, British Columbia, Canada
[doi> 10.1145/238386.238450]
|
| |
25
|
|
| |
26
|
L.I. Schiff. Quantum Mechanics, 3rd ed. McGraw-Hill, 1968.
|
| |
27
|
|
| |
28
|
Y. Shiloach and U. Vishkin. An o(log n) parallel connectivity algorithm. J. Algorithm, pages 57-67, 1982.
|
| |
29
|
R. E. Tarjan. Depth-first search and linear graph algorithms. SIAM J. Computing., J:146-160, 1972.
|
CITED BY 6
|
|
|
|
|
|
|
|
|
|
|
Miklos Kurucz , Andras Benczur , Karoly Csalogany , Laszlo Lukacs, Spectral clustering in telephone call graphs, Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, p.82-91, August 12-12, 2007, San Jose, California
|
|
|
|
|
|
|
|