|
ABSTRACT
Suppose there are many clients, each having some personal information, and one server, which is interested only in aggregate, statistically significant, properties of this information. The clients can protect privacy of their data by perturbing it with a randomization algorithm and then submitting the randomized version. The randomization algorithm is chosen so that aggregate properties of the data can be recovered with sufficient precision, while individual entries are significantly distorted. How much distortion is needed to protect privacy can be determined using a privacy measure. Several possible privacy measures are known; finding the best measure is an open question. This paper presents some methods and results in randomization for numerical and categorical data, and discusses the issue of measuring privacy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
3
|
|
 |
4
|
|
| |
5
|
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. CRC Press, Boca Raton, Florida, USA, 1984.
|
 |
6
|
|
| |
7
|
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal statistical Society, Series B, 39:1--38, 1977.
|
| |
8
|
|
| |
9
|
G. T. Duncan and S. Mukherjee. Optimal disclosure limitation strategy in statistical databases: Deterring tracker attacks through additive noise. Journal of the American Statistical Association, 95(451):720--729, 2000.
|
| |
10
|
T. Evans, L. Zayatz, and J. Slanta. Using noise for disclosure limitation of establishment tabular data. Journal of Official Statistics, 14(4):537--551, 1998.
|
 |
11
|
Alexandre Evfimievski , Ramakrishnan Srikant , Rakesh Agrawal , Johannes Gehrke, Privacy preserving mining of association rules, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, July 23-26, 2002, Edmonton, Alberta, Canada
[doi> 10.1145/775047.775080]
|
| |
12
|
S. E. Fienberg, U. E. Makov, and R. J. Steele. Disclosure limitation using perturbation and related methods for categorical data. Journal of Official Statistics, 14(4):485--502, 1998.
|
| |
13
|
J. M. Gouweleeuw, P. Kooiman, L. C. R. J. Willenborg, and P.-P. de Wolf. Post randomisation for statistical disclosure control: Theory and implementation. Journal of Official Statistics, 14(4):463--478, 1998.
|
 |
14
|
|
| |
15
|
J. J. Kim and W. E. Winkler. Masking microdata files, 1995.
|
| |
16
|
|
| |
17
|
D. Lambert. Measures of disclosure risk and harm. Journal of Official Statistics, 9(2):313--331, 1993.
|
| |
18
|
R. J. A. Little. Statistical analysis of masked data. Journal of Official Statistics, 9(2):407--426, 1993.
|
| |
19
|
G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. Wiley-Interscience, November 1996. 304 pp.
|
| |
20
|
|
 |
21
|
|
| |
22
|
S. J. Rizvi and J. R. Haritsa. Maintaining data privacy in association rule mining. In Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China, August 2002.
|
 |
23
|
|
| |
24
|
P. Samarati and L. Sweeney. Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. In Proceedings of the IEEE Symposium on Research in Security and Privacy, Oakland, California, USA, May 1998.
|
| |
25
|
C. E. Shannon. Communication theory of secrecy systems. Bell System Technical Journal, 28--4:656--715, 1949.
|
| |
26
|
|
| |
27
|
L. Zayatz, R. Moore, and B. T. Evans. New directions in disclosure limitation at the census bureau. In Proceedings of the Government Statistics Section, Joint Statistical Meetings. American Statistical Association, 1996.
|
CITED BY 15
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Raghu K. Ganti , Nam Pham , Yu-En Tsai , Tarek F. Abdelzaher, PoolView: stream privacy for grassroots participatory sensing, Proceedings of the 6th ACM conference on Embedded network sensor systems, November 05-07, 2008, Raleigh, NC, USA
|
|
|
|
|
|
|
|
|
|
|
|
Tarek Abdelzaher , Yaw Anokwa , Peter Boda , Jeff Burke , Deborah Estrin , Leonidas Guibas , Aman Kansal , Samuel Madden , Jim Reich, Mobiscopes for Human Spaces, IEEE Pervasive Computing, v.6 n.2, p.20-29, April 2007
|
|