|
ABSTRACT
Statistical machine learning techniques for data classification usually assume that all entities are i.i.d. (independent and identically distributed). However, real-world entities often interconnect with each other through explicit or implicit relationships to form a complex network. Although some graph-based classification methods have emerged in recent years, they are not really suitable for complex networks as they do not take the degree distribution of network into consideration. In this paper, we propose a new technique, Modularity Kernel, that can effectively exploit the latent community structure of networked entities for their classification. A number of experiments on hypertext datasets show that our proposed approach leads to excellent classification performance in comparison with the state-of-the-art methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
A. Argyriou, M. Herbster, and M. Pontil. Combining graph laplacians for semi-supervised learning. In Advances in Neural Information Processing Systems (NIPS), volume 18, pages 67?-74, Vancouver, Canada, 2005.
|
| |
3
|
M. Belkin, I. Matveeva, and P. Niyogi. Regularization and semi-supervised learning on large graphs. In Proceedings of the 17th Annual Conference on Learning Theory (COLT), pages 624--638, Banff, Canada, 2004.
|
| |
4
|
|
| |
5
|
|
| |
6
|
|
 |
7
|
Avrim Blum , John Lafferty , Mugizi Robert Rwebangira , Rajashekar Reddy, Semi-supervised learning using randomized mincuts, Proceedings of the twenty-first international conference on Machine learning, p.13, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015429]
|
 |
8
|
|
| |
9
|
S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and D.-U. Hwang. Complex networks : Structure and dynamics. Physics Reports, 424(4-5):175--308, 2006.
|
| |
10
|
|
| |
11
|
|
 |
12
|
Soumen Chakrabarti , Byron Dom , Piotr Indyk, Enhanced hypertext categorization using hyperlinks, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.307-318, June 01-04, 1998, Seattle, Washington, United States
|
| |
13
|
O. Chapelle, B. Scholkopf, and A. Zien, editors. Semi-Supervised Learning. MIT Press, 2005.
|
| |
14
|
O. Chapelle, J. Weston, and B. Scholkopf. Cluster kernels for semi-supervised learning. In Advances in Neural Information Processing Systems (NIPS), volume 15, pages 585--592, Vancouver, Canada, 2003.
|
| |
15
|
F. R. K. Chung. Spectral Graph Theory. American Mathematical Society, 1997.
|
| |
16
|
|
| |
17
|
|
| |
18
|
D. A. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems (NIPS), pages 430--436, Denver, CO, USA, 2000.
|
| |
19
|
|
| |
20
|
L. Danon, J. Duch, A. Diaz-Guilera, and A. Arenas. Comparing community structure identification. Journal of Statistical Mechanics, page P09008, 2005.
|
| |
21
|
|
| |
22
|
G. H. Golub and C. F. V. Loan. Matrix Computations. The Johns Hopkins University Press, 3rd edition, 1996.
|
| |
23
|
M. Gustafsson, A. Lombardi, and M. Hornquist. Comparison and validation of community structures in complex networks. Physica A, 367:559--576, 2006.
|
| |
24
|
Z. Gyöngyi, H. Garcia-Molina, and J. Pedersen. Web content categorization using link information. Technical report, Stanford University, 2007.
|
 |
25
|
|
 |
26
|
|
 |
27
|
|
| |
28
|
T. Joachims. Transductive learning via spectral graph partitioning. In Proceedings of the 20th International Conference on Machine Learning (ICML), pages 290--297, Washington, DC, 2003.
|
| |
29
|
|
 |
30
|
|
| |
31
|
|
| |
32
|
|
| |
33
|
|
| |
34
|
M. E. J. Newman. The structure and function of complex networks. SIAM Review, 45:167--256, 2003.
|
| |
35
|
M. E. J. Newman. Detecting community structure in networks. The European Physical Journal B, 38:321--330, 2004.
|
| |
36
|
M. E. J. Newman. Finding community structure in networks using the eigenvectors of matrices. Physical Review E, 74:036104, 2006.
|
| |
37
|
M. E. J. Newman. Modularity and community structure in networks. Proceedings of the National Academy of Sciences USA, 103:8577--8582, 2006.
|
| |
38
|
B. Scholkopf and A. J. Smola. Learning with Kernels. MIT Press, Cambridge, MA, 2002.
|
| |
39
|
|
| |
40
|
|
| |
41
|
V. Sindhwani, M. Belkin, and P. Niyogi. The geometric basis of semi-supervised learning. In O. Chapelle, B. Scholkopf, and A. Zien, editors, Semi-Supervised Learning, chapter 12, pages 209--226. MIT Press, 2006.
|
 |
42
|
|
| |
43
|
|
| |
44
|
M. Szummer and T. Jaakkola. Partially labeled classification with markov random walks. In Advances in Neural Information Processing Systems (NIPS), pages 945--952, Vancouver, Canada, 2001.
|
| |
45
|
B. Taskar, P. Abbeel, and D. Koller. Discriminative probabilistic models for relational data. In Proceedings of the 18th Conference in Uncertainty in Artificial Intelligence (UAI), pages 485--492, Edmonton, Alberta, Canada, 2002.
|
| |
46
|
|
| |
47
|
Q. Yang and X. Wu. 10 challenging problems in data mining research. International Journal of Information Technology and Decision Making, 5(4):597--604, 2006.
|
| |
48
|
|
 |
49
|
|
| |
50
|
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Scholkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems (NIPS), Vancouver and Whistler, Canada, 2003.
|
 |
51
|
|
| |
52
|
D. Zhou, B. Scholkopf, and T. Hofmann. Semi-supervised learning on directed graphs. In Advances in Neural Information Processing Systems (NIPS), Vancouver, Canada, 2004.
|
 |
53
|
|
| |
54
|
X. Zhu. Semi-supervised learning literature survey. Technical Report 1530, Computer Sciences, University of Wisconsin-Madison, 2005. http://www.cs.wisc.edu/ ~jerryzhu/pub/ssl_survey.pdf.
|
| |
55
|
X. Zhu, Z. Ghahraman, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning (ICML), pages 912--919, Washington, DC, 2003.
|
|