ACM Home Page
Please provide us with feedback. Feedback
Randomization in privacy preserving data mining
Full text PdfPdf (153 KB)
Source ACM SIGKDD Explorations Newsletter archive
Volume 4 ,  Issue 2  (December 2002) table of contents
Pages: 43 - 48  
Year of Publication: 2002
ISSN:1931-0145
Author
Alexandre Evfimievski  Cornell University, Ithaca, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 69,   Citation Count: 15
Additional Information:

abstract   references   cited by   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/772862.772869
What is a DOI?

ABSTRACT

Suppose there are many clients, each having some personal information, and one server, which is interested only in aggregate, statistically significant, properties of this information. The clients can protect privacy of their data by perturbing it with a randomization algorithm and then submitting the randomized version. The randomization algorithm is chosen so that aggregate properties of the data can be recovered with sufficient precision, while individual entries are significantly distorted. How much distortion is needed to protect privacy can be determined using a privacy measure. Several possible privacy measures are known; finding the best measure is an open question. This paper presents some methods and results in randomization for numerical and categorical data, and discusses the issue of measuring privacy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
4
 
5
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. CRC Press, Boca Raton, Florida, USA, 1984.
6
 
7
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal statistical Society, Series B, 39:1--38, 1977.
 
8
 
9
G. T. Duncan and S. Mukherjee. Optimal disclosure limitation strategy in statistical databases: Deterring tracker attacks through additive noise. Journal of the American Statistical Association, 95(451):720--729, 2000.
 
10
T. Evans, L. Zayatz, and J. Slanta. Using noise for disclosure limitation of establishment tabular data. Journal of Official Statistics, 14(4):537--551, 1998.
11
 
12
S. E. Fienberg, U. E. Makov, and R. J. Steele. Disclosure limitation using perturbation and related methods for categorical data. Journal of Official Statistics, 14(4):485--502, 1998.
 
13
J. M. Gouweleeuw, P. Kooiman, L. C. R. J. Willenborg, and P.-P. de Wolf. Post randomisation for statistical disclosure control: Theory and implementation. Journal of Official Statistics, 14(4):463--478, 1998.
14
 
15
J. J. Kim and W. E. Winkler. Masking microdata files, 1995.
 
16
 
17
D. Lambert. Measures of disclosure risk and harm. Journal of Official Statistics, 9(2):313--331, 1993.
 
18
R. J. A. Little. Statistical analysis of masked data. Journal of Official Statistics, 9(2):407--426, 1993.
 
19
G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. Wiley-Interscience, November 1996. 304 pp.
 
20
21
 
22
S. J. Rizvi and J. R. Haritsa. Maintaining data privacy in association rule mining. In Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China, August 2002.
23
 
24
P. Samarati and L. Sweeney. Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. In Proceedings of the IEEE Symposium on Research in Security and Privacy, Oakland, California, USA, May 1998.
 
25
C. E. Shannon. Communication theory of secrecy systems. Bell System Technical Journal, 28--4:656--715, 1949.
 
26
 
27
L. Zayatz, R. Moore, and B. T. Evans. New directions in disclosure limitation at the census bureau. In Proceedings of the Government Statistics Section, Joint Statistical Meetings. American Statistical Association, 1996.

CITED BY  15
 
 
 
 
 
 
 
 
 
 
 
Collaborative Colleagues:
Alexandre Evfimievski: colleagues

Peer to Peer - Readers of this Article have also read: