ACM Home Page
Please provide us with feedback. Feedback
Practical privacy: the SuLQ framework
Full text PdfPdf (237 KB)
Source Symposium on Principles of Database Systems archive
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Baltimore, Maryland
SESSION: Research session 3: security and privacy table of contents
Pages: 128 - 138  
Year of Publication: 2005
ISBN:1-59593-062-0
Authors
Avrim Blum  Carnegie Mellon University, Pittsburgh, PA
Cynthia Dwork  Microsoft Research, SVC, La Avenida, Mountain View, CA
Frank McSherry  Microsoft Research, SVC, La Avenida, Mountain View, CA
Kobbi Nissim  Ben-Gurion University, Israel
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGMOD: ACM Special Interest Group on Management of Data
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 72,   Citation Count: 31
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1065167.1065184
What is a DOI?

ABSTRACT

We consider a statistical database in which a trusted administrator introduces noise to the query responses with the goal of maintaining privacy of individual database entries. In such a database, a query consists of a pair (S, f) where S is a set of rows in the database and f is a function mapping database rows to {0, 1}. The true answer is ΣiεS f(di), and a noisy version is released as the response to the query. Results of Dinur, Dwork, and Nissim show that a strong form of privacy can be maintained using a surprisingly small amount of noise -- much less than the sampling error -- provided the total number of queries is sublinear in the number of database rows. We call this query and (slightly) noisy reply the SuLQ (Sub-Linear Queries) primitive. The assumption of sublinearity becomes reasonable as databases grow increasingly large.We extend this work in two ways. First, we modify the privacy analysis to real-valued functions f and arbitrary row types, as a consequence greatly improving the bounds on noise required for privacy. Second, we examine the computational power of the SuLQ primitive. We show that it is very powerful indeed, in that slightly noisy versions of the following computations can be carried out with very few invocations of the primitive: principal component analysis, k means clustering, the Perceptron Algorithm, the ID3 algorithm, and (apparently!) all algorithms that operate in the in the statistical query learning model [11].


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
S. Chawla, C. Dwork, F. McSherry, A. Smith and H. Wee, Toward Privacy in Public Databases, TCC 2005.
5
6
 
7
C. Dwork and N. Nissim, Privacy-Preserving Datamining on Vertically Partitioned Databases, Proceedings of CRYPTO 2004, pp. 528--544, 2004.
8
 
9
10
11
 
12
 
13
 
14
 
15
M. J. O'Connel, Search Program for Significant Variables, Comp. Phys. Comm. 8, 1974.
 
16
 
17

CITED BY  31
Collaborative Colleagues:
Avrim Blum: colleagues
Cynthia Dwork: colleagues
Frank McSherry: colleagues
Kobbi Nissim: colleagues