ACM Home Page
Please provide us with feedback. Feedback
On anonymizing query logs via token-based hashing
Full text PdfPdf (328 KB)
Source
International World Wide Web Conference archive
Proceedings of the 16th international conference on World Wide Web table of contents
Banff, Alberta, Canada
SESSION: Defending against emerging threats table of contents
Pages: 629 - 638  
Year of Publication: 2007
ISBN:978-1-59593-654-7
Authors
Ravi Kumar  Yahoo! Research
Jasmine Novak  Yahoo! Research
Bo Pang  Yahoo! Research
Andrew Tomkins  Yahoo! Research
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 72,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1242572.1242657
What is a DOI?

ABSTRACT

In this paper we study the privacy preservation properties of aspecific technique for query log anonymization: token-based hashing. In this approach, each query is tokenized, and then a secure hash function is applied to each token. We show that statistical techniques may be applied to partially compromise the anonymization. We then analyze the specific risks that arise from these partial compromises, focused on revelation of identity from unambiguous names, addresses, and so forth, and the revelation of facts associated with an identity that are deemed to be highly sensitive. Our goal in this work is two fold: to show that token-based hashing is unsuitable for anonymization, and to present a concrete analysis of specific techniques that may be effective in breaching privacy, against which other anonymization schemes should be measured.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
4
 
5
6
 
7
 
8
9
10
11
12
 
13
14
 
15
16
17
 
18
A. Slagell and W. Yurcik. Sharing computer network logs for security and privacy: A motivation for new methodologies of anonymization. In Workshop of the 1st International Conference on Security and Privacy for Emerging Areas in Communication Networks, pages 80--89, 2005.
 
19
 
20
 
21
22


Collaborative Colleagues:
Ravi Kumar: colleagues
Jasmine Novak: colleagues
Bo Pang: colleagues
Andrew Tomkins: colleagues