|
ABSTRACT
Privacy and security concerns can prevent sharing of data, derailing data mining projects. Distributed knowledge discovery, if done correctly, can alleviate this problem. The key is to obtain valid results, while providing guarantees on the (non)disclosure of data. We present a method for k-means clustering when different sites contain different attributes for a common set of entities. Each site learns the cluster of each entity, but learns nothing about the attributes at other sites.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
J. Benaloh. Dense probabilistic encryption. In Proceedings of the Workshop on Selected Areas of Cryptography, pages 120--128, Kingston, Ontario, May 1994.
|
| |
4
|
|
| |
5
|
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, B 39:1--38, 1977.
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
R. Duda and P. E. Hart. Pattern Classification and Scene. Analysis. John Wiley & Sons, 1973.
|
 |
10
|
Alexandre Evfimievski , Ramakrishnan Srikant , Rakesh Agrawal , Johannes Gehrke, Privacy preserving mining of association rules, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, July 23-26, 2002, Edmonton, Alberta, Canada
[doi> 10.1145/775047.775080]
|
| |
11
|
M. Feingold, M. Corzine, M. Wyden, and M. Nelson. Data-mining moratorium act of 2003. U.S. Senate Bill (proposed), Jan. 16 2003.
|
| |
12
|
M. Franklin and M. Yung. Varieties of secure distributed computing. In Proc. Sequences II, Methods in Communications, Security and Computer Science, pages 392--417, Positano, Italy, June 1991.
|
| |
13
|
|
| |
14
|
O. Goldreich. Secure multi-party computation, Sept. 1998. (working draft).
|
 |
15
|
|
| |
16
|
M. Kantarcioglu and C. Clifton. Privacy-preserving distributed mining of association rules on horizontally partitioned data. In The ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD'02), pages 24--31, Madison, Wisconsin, June 2 2002.
|
| |
17
|
M. Kantarcioĝlu and C. Clifton. Privacy-preserving distributed mining of association rules on horizontally partitioned data. IEEE-TKDE, submitted.
|
| |
18
|
|
| |
19
|
|
 |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. John Wiley & Sons, 1997.
|
 |
24
|
|
| |
25
|
T. Okamoto and S. Uchiyama. A new public-key cryptosystem as secure as factoring. In Advances in Cryptology - Eurocrypt '98, LNCS 1403, pages 308--318. Springer-Verlag, 1998.
|
| |
26
|
P. Paillier. Public key cryptosystems based on composite degree residuosity classes. In Advances in Cryptology - Eurocrypt '99 Proceedings, LNCS 1592, pages 223--238. Springer-Verlag, 1999.
|
| |
27
|
S. J. Rizvi and J. R. Haritsa. Maintaining data privacy in association rule mining. In Proceedings of 28th International Conference on Very Large Data Bases, pages 682--693, Hong Kong, Aug. 20--23 2002. VLDB.
|
 |
28
|
|
| |
29
|
A. C. Yao. How to generate and exchange secrets. In Proc. of the 27th IEEE Symposium on Foundations of Computer Science, pages 162--167. IEEE, 1986.
|
CITED BY 54
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ashish P. Sanil , Alan F. Karr , Xiaodong Lin , Jerome P. Reiter, Privacy preserving regression modelling via distributed computation, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
Mourad Ouzzani , Walid G. Aref , Elisa Bertino , Ann Christine Catlin , Christopher W. Clifton , Wing-Kai Hon , Ahmed K. Elmagarmid , Arif Ghafoor , Susanne E. Hambrusch , Sunil Prabhakar , Jeffrey S. Vitter , Xiang Zhang, The Indiana Center for Database Systems at Purdue University, ACM SIGMOD Record, v.34 n.2, June 2005
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ali İnan , Selim V. Kaya , Yücel Saygın , Erkay Savaş , Ayça A. Hintoğlu , Albert Levi, Privacy preserving clustering on horizontally partitioned data, Data & Knowledge Engineering, v.63 n.3, p.646-666, December, 2007
|
|
|
|
|
|
|
|
|
Mahir Can Doganay , Thomas B. Pedersen , Yücel Saygin , Erkay Savaş , Albert Levi, Distributed privacy preserving k-means clustering with additive secret sharing, Proceedings of the 2008 international workshop on Privacy and anonymity in information society, March 29-29, 2008, Nantes, France
|
|
|
|
|
|
Li Wan , Wee Keong Ng , Shuguo Han , Vincent C. S. Lee, Privacy-preservation for gradient descent methods, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
Li Wan , Wee Keong Ng , Shuguo Han , Vincent C. S. Lee, Privacy-preservation for gradient descent methods, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|