ACM Home Page
Please provide us with feedback. Feedback
Distributed privacy preserving k-means clustering with additive secret sharing
Full text PdfPdf (227 KB)
Source ACM International Conference Proceeding Series; Vol. 331 archive
Proceedings of the 2008 international workshop on Privacy and anonymity in information society table of contents
Nantes, France
SESSION: Distributed privacy protection and query auditing table of contents
Pages 3-11  
Year of Publication: 2008
ISBN:978-1-59593-965-4
Authors
Mahir Can Doganay  Sabanci University, Istanbul, Turkey
Thomas B. Pedersen  Sabanci University, Istanbul, Turkey
Yücel Saygin  Sabanci University, Istanbul, Turkey
Erkay Savaş  Sabanci University, Istanbul, Turkey
Albert Levi  Sabanci University, Istanbul, Turkey
Sponsor
: UNESCO
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 126,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1379287.1379291
What is a DOI?

ABSTRACT

Recent concerns about privacy issues motivated data mining researchers to develop methods for performing data mining while preserving the privacy of individuals. However, the current techniques for privacy preserving data mining suffer from high communication and computation overheads which are prohibitive considering even a modest database size. Furthermore, the proposed techniques have strict assumptions on the involved parties which need to be relaxed in order to reflect the real-world requirements. In this paper we concentrate on a distributed scenario where the data is partitioned vertically over multiple sites and the involved sites would like to perform clustering without revealing their local databases. For this setting, we propose a new protocol for privacy preserving k-means clustering based on additive secret sharing. We show that the new protocol is more secure than the state of the art. Experiments conducted on real and synthetic data sets show that, in realistic scenarios, the communication and computation cost of our protocol is considerably less than the state of the art which is crucial for data mining applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
 
5
 
6
S. V. Kaya, T. B. Pedersen, E. Savaş, and Y. Saygin. Efficient privacy preserving distributed clustering based on secret sharing. In PAKDD 2007 International Workshops: Emerging Technologies in Knowledge Discovery and Data Mining, pages 280--291. Springer, 2007.
 
7
Selim Volkan Kaya. Toolbox for Privacy Preserving Data Mining. Master's thesis, Sabanci University, Istanbul, TURKEY, July 2007.
8
 
9
 
10
Pascal Paillier. Public-key cryptosystems based on composite degree residuosity classes. In Advances in Cryptology --- EUROCRYPT '99. International Conference on the Theory and Application of Cryptographic Techniques, Lecture Notes in Computer Science, pages 223--238. Springer-Verlag, May 1999.
 
11
12
 
13
Jaikumar Vijayan. House committee chair wants info on cancelled dhs data-mining programs. Computer World, September 18 2007.
14
 
15


Collaborative Colleagues:
Mahir Can Doganay: colleagues
Thomas B. Pedersen: colleagues
Yücel Saygin: colleagues
Erkay Savaş: colleagues
Albert Levi: colleagues