|
ABSTRACT
In this paper we study asymmetric proximity measures on directed graphs, which quantify the relationships between two nodes or two groups of nodes. The measures are useful in several graph mining tasks, including clustering, link prediction and connection subgraph discovery. Our proximity measure is based on the conceptof escape probability. This way, we strive to summarize the multiple facets of nodes-proximity, while avoiding some of the pitfalls to which alternative proximity measures are susceptible. A unique feature of the measures is accounting for the underlying directional information. We put a special emphasis on computational efficiency, and develop fast solutions that are applicable in several settings. Our experimental study shows the usefulness of our proposed direction-aware proximity method for several applications, and that our algorithms achieve a significant speedup (up to 50,000x) over straight forward implementations.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
B. Aditya , Gaurav Bhalotia , Soumen Chakrabarti , Arvind Hulgeri , Charuta Nakhe , Parag Parag , S. Sudarshan, BANKS: browsing and keyword searching in relational databases, Proceedings of the 28th international conference on Very Large Data Bases, p.1083-1086, August 20-23, 2002, Hong Kong, China
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
P. Doyle and J. Snell. Random walks and electric networks, volume 22. Mathematical Association America, New York, 1984.
|
 |
6
|
|
 |
7
|
|
| |
8
|
J. Friedman. Greedy function approximation: A gradient boosting machine. In Annual of Statistics, 2001.
|
| |
9
|
|
| |
10
|
G. Golub and C. Loan. Matrix Computation. Johns Hopkins, 1996.
|
| |
11
|
M. Grötschel, C. L. Monma, and M. Stoer. Design of survivable networks. In Handbooks in Operations Research and Management Science 7: Network Models. North Holland, 1993.
|
| |
12
|
T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning. Springer.
|
 |
13
|
|
 |
14
|
Jingrui He , Mingjing Li , Hong-Jiang Zhang , Hanghang Tong , Changshui Zhang, Manifold-ranking based image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027531]
|
 |
15
|
|
 |
16
|
|
 |
17
|
Jia-Yu Pan , Hyung-Jeong Yang , Christos Faloutsos , Pinar Duygulu, Automatic multimedia cross-modal correlation discovery, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014135]
|
| |
18
|
|
 |
19
|
|
CITED BY 5
|
|
Luh Yen , Marco Saerens , Amin Mantrach , Masashi Shimbo, A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
Hanghang Tong , Yasushi Sakurai , Tina Eliassi-Rad , Christos Faloutsos, Fast mining of complex time-stamped events, Proceeding of the 17th ACM conference on Information and knowledge management, October 26-30, 2008, Napa Valley, California, USA
|
|
|
|
|