ACM Home Page
Please provide us with feedback. Feedback
On the tradeoff between privacy and utility in data publishing
Full text MovMov (15:53),  PdfPdf (454 KB)
Source
International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Paris, France
SESSION: Research track papers table of contents
Pages 517-526  
Year of Publication: 2009
ISBN:978-1-60558-495-9
Authors
Tiancheng Li  Purdue University, West Lafayette, IN, USA
Ninghui Li  Purdue University, West Lafayette, IN, USA
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557019.1557079
What is a DOI?

ABSTRACT

In data publishing, anonymization techniques such as generalization and bucketization have been designed to provide privacy protection. In the meanwhile, they reduce the utility of the data. It is important to consider the tradeoff between privacy and utility. In a paper that appeared in KDD 2008, Brickell and Shmatikov proposed an evaluation methodology by comparing privacy gain with utility gain resulted from anonymizing the data, and concluded that "even modest privacy gains require almost complete destruction of the data-mining utility". This conclusion seems to undermine existing work on data anonymization. In this paper, we analyze the fundamental characteristics of privacy and utility, and show that it is inappropriate to directly compare privacy with utility. We then observe that the privacy-utility tradeoff in data publishing is similar to the risk-return tradeoff in financial investment, and propose an integrated framework for considering privacy-utility tradeoff, borrowing concepts from the Modern Portfolio Theory for financial investment. Finally, we evaluate our methodology on the Adult dataset from the UCI machine learning repository. Our results clarify several common misconceptions about data utility and provide data publishers useful guidelines on choosing the right tradeoff between privacy and utility.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
A. Asuncion and D. Newman. UCI machine learning repository, 2007.
 
3
M. Barbaro and T. Z. Jr. A face is exposed for aol searcher no. 4417749. New York Times, 2006.
 
4
5
 
6
G. T. Duncan and D. Lambert. Disclosure-limited data dissemination. J. Am. Stat. Assoc., pages 10--28, 1986.
 
7
C. Dwork. Differential privacy. In ICALP, pages 1--12, 2006.
 
8
E. Elton and M. Gruber. Modern Portfolio Theory and Investment Analysis. John Wiley&Sons Inc, 1995.
 
9
B. C. M. Fung, K. Wang, R. Chen, and P. S. Yu. Privacy-preserving data publishing: A survey on recent developments. ACM Computing Survey, 2009.
 
10
11
12
13
 
14
N. Koudas, D. Srivastava, T. Yu, and Q. Zhang. Aggregate query answering on anonymized tables. In ICDE, pages 116--125, 2007.
 
15
S. L. Kullback and R. A. Leibler. On information and sufficiency. Ann. Math. Stat., 22:79--86, 1951.
 
16
D. Lambert. Measures of disclosure risk and harm. J. Official Stat., 9:313--331, 1993.
 
17
18
 
19
N. Li, T. Li, and S. Venkatasubramanian. t-closeness: Privacy beyond k-anonymity and l-diversity. In ICDE, pages 106--115, 2007.
 
20
 
21
 
22
D. J. Martin, D. Kifer, A. Machanavajjhala, J. Gehrke, and J. Y. Halpern. Worst-case background knowledge for privacy-preserving data publishing. In ICDE, pages 126--135, 2007.
23
 
24
 
25
 
26
 
27
 
28
29
 
30
 
31

Collaborative Colleagues:
Tiancheng Li: colleagues
Ninghui Li: colleagues