ACM Home Page
Please provide us with feedback. Feedback
Anonymizing bipartite graph data using safe groupings
Full text PdfPdf (517 KB)
Source
Proceedings of the VLDB Endowment archive
Volume 1 ,  Issue 1  (August 2008) table of contents
SESSION: Privacy preservation table of contents
Pages 833-844  
Year of Publication: 2008
ISSN:2150-8097
Authors
Graham Cormode  AT&T Labs-Research, Florham Park, NJ
Divesh Srivastava  AT&T Labs-Research, Florham Park, NJ
Ting Yu  North Carolina State University, Raleigh, NC
Qing Zhang  North Carolina State University, Raleigh, NC
Publisher
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 93,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1453856.1453947
What is a DOI?

ABSTRACT

Private data often comes in the form of associations between entities, such as customers and products bought from a pharmacy, which are naturally represented in the form of a large, sparse bipartite graph. As with tabular data, it is desirable to be able to publish anonymized versions of such data, to allow others to perform ad hoc analysis of aggregate graph properties. However, existing tabular anonymization techniques do not give useful or meaningful results when applied to graphs: small changes or masking of the edge structure can radically change aggregate graph properties.

We introduce a new family of anonymizations, for bipartite graph data, called (k, l)-groupings. These groupings preserve the underlying graph structure perfectly, and instead anonymize the mapping from entities to nodes of the graph. We identify a class of "safe" (k, l)-groupings that have provable guarantees to resist a variety of attacks, and show how to find such safe groupings. We perform experiments on real bipartite graph data to study the utility of the anonymized version, and the impact of publishing alternate groupings of the same graph data. Our experiments demonstrate that (k, l)-groupings offer strong tradeoffs between privacy and utility.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
J. Bennett and S. Lanning. The Netflix prize. In KDDCup, 2007.
 
3
 
4
G. Ghinita, Y. Tao, and P. Kalnis. On the anonymization of sparse high-dimensional data. In ICDE, 2008.
 
5
M. Hay, G. Miklau, D. Jensen, P. Weis, and S. Srivastava. Anonymizing social networks. Tech Report 07--19, U. Mass Amherst, 2007.
6
 
7
N. Li, T. Li, and S. Venkatasubramanian. t-closeness: Privacy beyond k-anonymity and l-diversity. In ICDE, 2007.
 
8
 
9
D. J. Martin, D. Kifer, A. Machanavajjhala, and J. Gehrke. Worse-case background knowledge for privacy-preserving data publishing. In ICDE, 2007.
 
10
M. E. Nergiz, C. Clifton, and A. E. Nergiz. Multirelational k-anonymity. In ICDE, 2007.
 
11
 
12
13
 
14
 
15
 
16
Q. Zhang, N. Koudas, D. Srivastava, and T. Yu. Aggregate query answering on anonymized tables. In ICDE, 2007.
 
17
E. Zheleva and L. Getoor. Preserving the privacy of sensitive relationships in graph data. In (PinKDD), 2007.
 
18
B. Zhou and J. Pei. Preserving privacy in social networks against neighborhood attacks. In ICDE, 2008.


Collaborative Colleagues:
Graham Cormode: colleagues
Divesh Srivastava: colleagues
Ting Yu: colleagues
Qing Zhang: colleagues