|
ABSTRACT
Large and complex graphs representing relationships among sets of entities are an increasingly common focus of interest in data analysis---examples include social networks, Web graphs, telecommunication networks, and biological networks. In interactive analysis of such data a natural query is "which entities are most important in the network relative to a particular individual or set of individuals?" We investigate the problem of answering such queries in this paper, focusing in particular on defining and computing the importance of nodes in a graph relative to one or more root nodes. We define a general framework and a number of different algorithms, building on ideas from social networks, graph theory, Markov models, and Web graph analysis. We experimentally evaluate the different properties of these algorithms on toy graphs and demonstrate how our approach can be used to study relative importance in real-world networks including a network of interactions among September 11th terrorists, a network of collaborative research in biotechnology among companies and universities, and a network of co-authorship relationships among computer science researchers.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Allan Borodin , Gareth O. Roberts , Jeffrey S. Rosenthal , Panayiotis Tsaparas, Finding authorities and hubs from link structures on the World Wide Web, Proceedings of the 10th international conference on World Wide Web, p.415-429, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372096]
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
| |
5
|
Freeman, L. (1979) Centrality in social networks: I. conceptual clarification. Social Networks, 1, pp. 215--239.
|
 |
6
|
|
| |
7
|
Hoff, P. D., Raftery, A. E. Handcock, M. S. (2002) Latent space approaches to social network analysis. Journal of The American Statistical Association, 97, pp. 1090--1098.
|
| |
8
|
Jeh, G. and Widom J. (2002) Scaling personalized Web search. Stanford University, Computer Science Department Technical Report.
|
| |
9
|
Katz, L. (1953) A new status index derived from sociometric analysis. Psychometrika, 18, pp. 39--43.
|
| |
10
|
Kemeny, J. and Snell, J. (1976) Finite Markov Chains. Springer Verlag.
|
 |
11
|
|
| |
12
|
Krebs, V. E. (2001) Mapping networks of terrorist cells. Connections, 24(3), pp. 43--52.
|
| |
13
|
|
| |
14
|
|
| |
15
|
Page, L., Brin, S., Motwani, R., Winograd, T. (1998) The PageRank citation ranking: bringing order to the web. Stanford University, Computer Science Department Technical Report.
|
| |
16
|
Powell, W. W., White, D. R., Koput, K. W., and Owen-Smith, J. (2002) Network dynamics and field evolution: the growth of interorganizational collaboration in the life sciences. Submitted.
|
| |
17
|
Stephenson, K., and Zelen, M. (1989) Rethinking centrality: methods and examples. Social Networks, 11, pp. 1--37.
|
| |
18
|
Wasserman, S., and Faust, K. (1994) Social Network Analysis: Methods and Applications. Cambridge: Cambridge University Press.
|
CITED BY 24
|
|
|
|
|
Mark Steyvers , Padhraic Smyth , Michal Rosen-Zvi , Thomas Griffiths, Probabilistic author-topic models for information discovery, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
Takahiko Ito , Masashi Shimbo , Taku Kudo , Yuji Matsumoto, Application of kernels to link analysis, Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, August 21-24, 2005, Chicago, Illinois, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ding Zhou , Eren Manavoglu , Jia Li , C. Lee Giles , Hongyuan Zha, Probabilistic models for discovering e-communities, Proceedings of the 15th international conference on World Wide Web, May 23-26, 2006, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
Xiaodan Song , Yun Chi , Koji Hino , Belle L. Tseng, Information flow modeling based on diffusion rate for prediction and ranking, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
|
|
|
|
|
|
Luh Yen , Marco Saerens , Amin Mantrach , Masashi Shimbo, A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, August 24-27, 2008, Las Vegas, Nevada, USA
|
|
|
|
|
|
|
|
|
Jyh-Ren Shieh , Yang-Ting Yeh , Chih-Hung Lin , Ching-Yung Lin , Ja-Ling Wu, Collaborative knowledge semantic graph image search, Proceeding of the 17th international conference on World Wide Web, April 21-25, 2008, Beijing, China
|
|
|
|
|
|
|
|
|
Luh Yen , Francois Fouss , Christine Decaestecker , Pascal Francq , Marco Saerens, Graph nodes clustering with the sigmoid commute-time kernel: A comparative study, Data & Knowledge Engineering, v.68 n.3, p.338-361, March, 2009
|
|
|
|
|
|
|
|
|
|
|
|
Foster Provost , Brian Dalessandro , Rod Hook , Xiaohan Zhang , Alan Murray, Audience selection for on-line brand advertising: privacy-friendly social network targeting, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|