|
ABSTRACT
In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal communications. A personal profile, called CommunityNet, is established for each individual based on a novel algorithm incorporating contact, content, and time information simultaneously. It can be used for personal social capital management. Clusters of CommunityNets provide a view of informal networks for organization management. Our new algorithm is developed based on the combination of dynamic algorithms in the social network field and the semantic content classification methods in the natural language processing and machine learning literatures. We tested CommunityNets on the Enron Email corpus and report experimental results including filtering, prediction, and recommendation capabilities. We show that the personal behavior and intention are somewhat predictable based on these models. For instance, "to whom a person is going to send a specific email" can be predicted by one's personal social network and content analysis. Experimental results show the prediction accuracy of the proposed adaptive algorithm is 58% better than the social network-based predictions, and is 75% better than an aggregated model based on Latent Dirichlet Allocation with social network enhancement. Two online demo systems we developed that allow interactive exploration of CommunityNet are also discussed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
B. A. Nardi, S. Whittaker, and H. Schwarz. "It's not what you know, it's who you know: work in the information age," First Mon., 5, 2000.
|
| |
2
|
D. Krackhardt, "Panel on Informal Networks within Formal Organizations," XXV Intl. Social Network Conf., Feb. 2005.
|
| |
3
|
D. Krackhardt and M. Kilduff, "Structure, culture and Simmelian ties in entrepreneurial firms," Social Networks, Vol. 24, 2002.
|
 |
4
|
Bonnie A. Nardi , Steve Whittaker , Ellen Isaacs , Mike Creech , Jeff Johnson , John Hainsworth, Integrating communication and information through ContactMap, Communications of the ACM, v.45 n.4, p.89-95, April 2002
[doi> 10.1145/505248.505251]
|
| |
5
|
https://www.linkedin.com/home?trk=logo.
|
| |
6
|
https://www.orkut.com/Login.aspx.
|
| |
7
|
|
| |
8
|
N. Lin, "Social Capital," Cambridge Univ. Press, 2001.
|
| |
9
|
W. Cohen. http://www-2.cs.cmu.edu/~enron/.
|
| |
10
|
S. Milgram. "The Small World Problem," Psychology Today, pp 60--67, May 1967.
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
L. Page, S. Brin, R. Motwani and T. Winograd. "The PageRank Citation Ranking: Bringing Order to the Web," Stanford Digital Libraries Working Paper, 1998.
|
| |
16
|
|
| |
17
|
S. Wasserman, and P. E. Pattison, "Logit models and logistic regression for social networks: I. An introduction to Markov graphs and p*", Psychometrika, 61: 401-- 425, 1996.
|
| |
18
|
T. A.B. Snijders. "Models for Longitudinal Network Data," Chapter 11 in Models and methods in social network analysis, New York: Cambridge University Press, 2004.
|
 |
19
|
|
| |
20
|
Jeremy Kubica , Andrew Moore , Jeff Schneider , Yiming Yang, Stochastic link and group detection, Eighteenth national conference on Artificial intelligence, p.798-804, July 28-August 01, 2002, Edmonton, Alberta, Canada
|
| |
21
|
M. Handcock and D. Hunter, "Curved Exponential Family Models for Networks," XXV Intl. Social Network Conf., Feb. 2005.
|
| |
22
|
T. Hofmann, "Probabilistic Latent Semantic Analysis," Proc. of the Conf. on Uncertainty in Artificial Intelligence, 1999.
|
| |
23
|
|
| |
24
|
T. Griffiths and M. Steyvers, "Finding Scientific Topics," Proc. of the National Academy of Sciences, 5228--5235, 2004.
|
| |
25
|
Michal Rosen-Zvi , Thomas Griffiths , Mark Steyvers , Padhraic Smyth, The author-topic model for authors and documents, Proceedings of the 20th conference on Uncertainty in artificial intelligence, p.487-494, July 07-11, 2004, Banff, Canada
|
| |
26
|
A. McCallum, A. Corrada-Emmanuel, and X. Wang, "The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks: Experiments with Enron and Academic Email," Technical Report UM-CS-2004-096, 2004.
|
| |
27
|
X. Song, B. L. Tseng, C.-Y. Lin, and M.-T. Sun, "ExpertiseNet: Relational and Evolutionary Expert Modeling," 10th Intl. Conf. on User Modeling, Edinburgh, UK, July 24-30, 2005.
|
 |
28
|
|
| |
29
|
|
| |
30
|
J. Breese, D. Heckerman, and C. Kadie. "Empirical analysis of predictive algorithms for collaborative filtering," Conf. on Uncertainty in Artificial Intelligence, Madison,WI, July 1998.
|
CITED BY 8
|
|
|
|
|
Xiaodan Song , Belle L. Tseng , Ching-Yung Lin , Ming-Ting Sun, Personalized recommendation driven by information flow, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Christopher P. Diehl , Galileo Namata , Lise Getoor, Relationship identification for social network discovery, Proceedings of the 22nd national conference on Artificial intelligence, p.546-552, July 22-26, 2007, Vancouver, British Columbia, Canada
|
|
|
|
|