ACM Home Page
Please provide us with feedback. Feedback
An integer programming approach for frequent itemset hiding
Full text PdfPdf (317 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the 15th ACM international conference on Information and knowledge management table of contents
Arlington, Virginia, USA
SESSION: Privacy, string search table of contents
Pages: 748 - 757  
Year of Publication: 2006
ISBN:1-59593-433-2
Authors
Aris Gkoulalas-Divanis  University of Thessaly, Volos, GREECE
Vassilios S. Verykios  University of Thessaly, Volos, GREECE
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 99,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1183614.1183721
What is a DOI?

ABSTRACT

The rapid growth of transactional data brought, soon enough, into attention the need of its further exploitation. In this paper, we investigate the problem of securing sensitive knowledge from being exposed in patterns extracted during association rule mining. Instead of hiding the produced rules directly, we decide to hide the sensitive frequent itemsets that may lead to the production of these rules. As a first step, we introduce the notion of distance between two databases and a measure for quantifying it. By trying to minimize the distance between the original database and its sanitized version (that can safely be released), we propose a novel, exact algorithm for association rule hiding and evaluate it on real world datasets demonstrating its effectiveness towards solving the problem.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
ILOG CPLEX 9.0 User's Manual. ILOG Inc., Gentilly, France, 2003.
 
2
 
3
4
 
5
C. Clifton, M. Kantarcioglu, and J. Vaidya. Defining privacy for data mining. WNGDM '02: National Science Foundation Workshop on Next Generation Data Mining, pages 126--133, 2002.
 
6
C. Clifton and D. Marks. Security and privacy implications of data mining. SIGMOD '96: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pages 15--20, 1996.
7
 
8
 
9
C. Gueret, C. Prins, and M. Sevaux. Applications of Optimization with Xpress-MP. Dash Optimization Ltd., 2002.
10
 
11
D. Luenberger. Introduction to Linear and Non-linear Programming. Addison-Wesley Publishing Company, 1973.
 
12
S. Menon, S. Sarkar, and S. Mukherjee. Maximizing accuracy of shared databases when concealing sensitive patterns. Information Systems Research, 16(3):256--270, 2005.
 
13
M. Morgenstern. Controlling logical inference in multilevel database and knowledge-base systems. IEEE '88: Proceedings of the 1988 IEEE Symposium on Security and Privacy, pages 245--255, 1988.
 
14
15
 
16
 
17
Y. Saad. SPARSKIT: A basic toolkit for sparse matrix computations (version 2). Technical report, University of Minnesota, Minneapolis, MN, 1994. http://www-users.cs.umn.edu/~saad/software/SPARSKIT/sparskit.html.
 
18
Y. Saygin, V. Verykios, and C. Clifton. Using unknowns to prevent discovery of association rules. SIGMOD '01: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, pages 45--54, 2001.
 
19


Collaborative Colleagues:
Aris Gkoulalas-Divanis: colleagues
Vassilios S. Verykios: colleagues