| The union-split algorithm and cluster-based anonymization of social networks |
| Full text |
Pdf
(842 KB)
|
Source
|
ASIAN ACM Symposium on Information, Computer and Communications Security
archive
Proceedings of the 4th International Symposium on Information, Computer, and Communications Security
table of contents
Sydney, Australia
SESSION: Anonymity and privacy
table of contents
Pages 218-227
Year of Publication: 2009
ISBN:978-1-60558-394-5
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 10, Downloads (12 Months): 92, Citation Count: 0
|
|
|
ABSTRACT
Knowledge discovery on social network data can uncover latent social trends and produce valuable findings that benefit the welfare of the general public. A growing amount of research finds that social networks play a surprisingly powerful role in people's behaviors. Before the social network data can be released for research purposes, the data needs to be anonymized to prevent potential re-identification attacks. Most of the existing anonymization approaches were developed for relational data, and cannot be used to handle social network data directly. In this paper, we model social networks as undirected graphs and formally define privacy models, attack models for the anonymization problem, in particular an i-hop degree-based anonymization problem, i.e., the adversary's prior knowledge includes the target's degree and the degrees of neighbors within i hops from the target. We present two new and efficient clustering methods for undirected graphs: bounded t-means clustering and union-split clustering algorithms that group similar graph nodes into clusters with a minimum size constraint. These clustering algorithms are contributions beyond the specific social network problems studied and can be used to cluster general data types besides graph vertices. We also develop a simple-yet-effective inter-cluster matching method for anonymizing social networks by strategically adding and removing edges based on nodes' social roles. We carry out a series of experiments to evaluate the graph utilities of the anonymized social networks produced by our algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
P. Bradley, K. Bennett, and A. Demiriz. Constrained k-means clustering. Technical Report MSR-TR-2000-65, Microsoft Research, 2000.
|
| |
2
|
J.-W. Byun, A. Kamra, E. Bertino, and N. Li. Efficient k-anonymization using clustering techniques. In Proceedings of the 12th International Conference on Database Systems for Advanced Applications (DASFAA), volume 4443 of Lecture Notes in Computer Science, pages 188--200. Springer, 2007.
|
| |
3
|
Cambridge English Dictionary. http://dictionary.cambridge.org/define.asp? key=68410&dict=CALD.
|
| |
4
|
D. Chakrabarti, Y. Zhan, and C. Faloutsos. R-MAT: A recursive model for graph mining. In M. W. Berry, U. Dayal, C. Kamath, and D. B. Skillicorn, editors, SDM. SIAM, 2004.
|
| |
5
|
G. Cormode, D. Srivastava, T. Yu, and Q. Zhang. Anonymizing bipartite graph data using safe groupings. In Proceedings of the International Conference on Very Large Data Bases (VLDB), 2008.
|
| |
6
|
|
| |
7
|
N. Li, T. Li, and S. Venkatasubramanian. t-closeness: Privacy beyond k-anonymity and l-diversity. In Proceedings of the 23rd International Conference on Data Engineering (ICDE), pages 106--115, 2007.
|
 |
8
|
|
| |
9
|
|
| |
10
|
S. Milgram. The small world problem. Psychology Today, 1(1):60--67, May 1967.
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
M. E. Nergiz, C. Clifton, and A. E. Nergiz. Multirelational k-anonymity. In Proceedings of the 23rd International Conference on Data Engineering (ICDE), pages 1417--1421, 2007.
|
| |
15
|
Netflix Prize. http://www.netflixprize.com.
|
 |
16
|
|
| |
17
|
R. Stein. Social networks' sway may be underestimated. Washington Post, May 26, 2008.
|
| |
18
|
|
 |
19
|
Raymond Chi-Wing Wong , Jiuyong Li , Ada Wai-Chee Fu , Ke Wang, (α, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
[doi> 10.1145/1150402.1150499]
|
| |
20
|
V. N. Zemlyachenko, N. M. Korneenko, and R. I. Tyshkevich. Graph isomorphism problem. Journal of Mathematical Sciences, 29(4):1426Ü1481, May 1985.
|
| |
21
|
B. Zhou and J. Pei. Preserving privacy in social networks against neighborhood attacks. In Proceedings of the 24th International Conference on Data Engineering (ICDE), pages 506--515, 2008.
|
|