| An integer programming approach for frequent itemset hiding |
| Full text |
Pdf
(317 KB)
|
| Source
|
Conference on Information and Knowledge Management
archive
Proceedings of the 15th ACM international conference on Information and knowledge management
table of contents
Arlington, Virginia, USA
SESSION: Privacy, string search
table of contents
Pages: 748 - 757
Year of Publication: 2006
ISBN:1-59593-433-2
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 15, Downloads (12 Months): 99, Citation Count: 2
|
|
|
ABSTRACT
The rapid growth of transactional data brought, soon enough, into attention the need of its further exploitation. In this paper, we investigate the problem of securing sensitive knowledge from being exposed in patterns extracted during association rule mining. Instead of hiding the produced rules directly, we decide to hide the sensitive frequent itemsets that may lead to the production of these rules. As a first step, we introduce the notion of distance between two databases and a measure for quantifying it. By trying to minimize the distance between the original database and its sanitized version (that can safely be released), we propose a novel, exact algorithm for association rule hiding and evaluate it on real world datasets demonstrating its effectiveness towards solving the problem.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
ILOG CPLEX 9.0 User's Manual. ILOG Inc., Gentilly, France, 2003.
|
| |
2
|
|
| |
3
|
|
 |
4
|
|
| |
5
|
C. Clifton, M. Kantarcioglu, and J. Vaidya. Defining privacy for data mining. WNGDM '02: National Science Foundation Workshop on Next Generation Data Mining, pages 126--133, 2002.
|
| |
6
|
C. Clifton and D. Marks. Security and privacy implications of data mining. SIGMOD '96: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pages 15--20, 1996.
|
 |
7
|
|
| |
8
|
|
| |
9
|
C. Gueret, C. Prins, and M. Sevaux. Applications of Optimization with Xpress-MP. Dash Optimization Ltd., 2002.
|
 |
10
|
|
| |
11
|
D. Luenberger. Introduction to Linear and Non-linear Programming. Addison-Wesley Publishing Company, 1973.
|
| |
12
|
S. Menon, S. Sarkar, and S. Mukherjee. Maximizing accuracy of shared databases when concealing sensitive patterns. Information Systems Research, 16(3):256--270, 2005.
|
| |
13
|
M. Morgenstern. Controlling logical inference in multilevel database and knowledge-base systems. IEEE '88: Proceedings of the 1988 IEEE Symposium on Security and Privacy, pages 245--255, 1988.
|
| |
14
|
|
 |
15
|
Emmanuel D. Pontikakis , Yannis Theodoridis , Achilleas A. Tsitsonis , Liwu Chang , Vassilios S. Verykios, A quantitative and qualitative ANALYSIS of blocking in association rule hiding, Proceedings of the 2004 ACM workshop on Privacy in the electronic society, October 28-28, 2004, Washington DC, USA
[doi> 10.1145/1029179.1029187]
|
| |
16
|
|
| |
17
|
Y. Saad. SPARSKIT: A basic toolkit for sparse matrix computations (version 2). Technical report, University of Minnesota, Minneapolis, MN, 1994. http://www-users.cs.umn.edu/~saad/software/SPARSKIT/sparskit.html.
|
| |
18
|
Y. Saygin, V. Verykios, and C. Clifton. Using unknowns to prevent discovery of association rules. SIGMOD '01: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, pages 45--54, 2001.
|
| |
19
|
|
CITED BY 2
|
|
Arjun Dasgupta , Nan Zhang , Gautam Das , Surajit Chaudhuri, Privacy preservation of aggregates in hidden databases: why and how?, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|