|
ABSTRACT
In order to address privacy concerns, many social media websites allow users to hide their personal profiles from the public. In this work, we show how an adversary can exploit an online social network with a mixture of public and private user profiles to predict the private attributes of users. We map this problem to a relational classification problem and we propose practical models that use friendship and group membership information (which is often not hidden) to infer sensitive attributes. The key novel idea is that in addition to friendship links, groups can be carriers of significant information. We show that on several well-known social media sites, we can easily and accurately recover the information of private-profile users. To the best of our knowledge, this is the first work that uses link-based and group-based classification to study privacy implications in social networks with mixed public and private user profiles.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G. Aggarwal, T. Feder, K. Kenthapadi, R. Motwani, R. Panigrahy, D. Thomas, and A. Zhu. Approximation algorithms for k--anonimity. JPT, Nov. 2005.
|
| |
2
|
|
 |
3
|
Lars Backstrom , Cynthia Dwork , Jon Kleinberg, Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography, Proceedings of the 16th international conference on World Wide Web, May 08-12, 2007, Banff, Alberta, Canada
[doi> 10.1145/1242572.1242598]
|
| |
4
|
D. Baldassarri and A. Gelman. Partisans without constraint: Political polarization and trends in american public opinion. American Journal of Sociology, 114(2):408--446, September 2008.
|
| |
5
|
|
| |
6
|
C. Dwork. Differential privacy. In ICALP, 2006. L. Getoor and B. Taskar, editors. Introduction to statistical relational learning. MIT Press, 2007.
|
| |
7
|
|
| |
8
|
J. He, W. Chu, and Z. Liu. Inferring privacy information from social networks. In ISI, 2006.
|
| |
9
|
K. Lewis, J. Kaufman, M. Gonzalez, A. Wimmer, and N. Christakis. Tastes, ties, and time. hd l:1902.1/11827.
|
| |
10
|
N. Li, T. Li, and S. Venkatasubramanian. t-closeness: Privacy beyond k-anon. and l-diversity. In ICDE, 2007.
|
| |
11
|
D. Liben-Nowell, J. Novak, R. Kumar, P. Raghavan, and A. Tomkins. Geographic routing in social networks. PNAS, 102(33):11623--11628, August 2005.
|
 |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
A. Machanava jjhala, J. Gehrke, D. Kifer, and M. Venkitasubramaniam. l-diversity: Privacy beyond k-anonymity. In ICDE, 2006.
|
| |
16
|
|
| |
17
|
|
| |
18
|
M. E. Nergiz and C. Clifton. Multirelational k-anonymity. In ICDE, April 2007.
|
| |
19
|
|
| |
20
|
P. Sen, G. M. Namata, M. Bilgic, L. Getoor, B. Gallagher, and T. Eliassi--Rad. Collective classification in network data. Technical Report CS-TR-4905, Univ. of Maryland, 2008.
|
| |
21
|
|
 |
22
|
Ioannis Tsochantaridis , Thomas Hofmann , Thorsten Joachims , Yasemin Altun, Support vector machine learning for interdependent and structured output spaces, Proceedings of the twenty-first international conference on Machine learning, p.104, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015341]
|
| |
23
|
Y. Wang and G. Wong. Stochastic blockmodels for directed graphs. JASA, 1987.
|
| |
24
|
E. Zheleva and L. Getoor. Preserving the privacy of sensitive relationships in graph data. PinKDD, 2007.
|
|