ACM Home Page
Please provide us with feedback. Feedback
The risk-utility tradeoff for IP address truncation
Full text PdfPdf (225 KB)
Source
Conference on Computer and Communications Security archive
Proceedings of the 1st ACM workshop on Network data anonymization table of contents
Alexandria, Virginia, USA
SESSION: Anonymization techniques and metrics table of contents
Pages 23-30  
Year of Publication: 2008
ISBN:978-1-60558-301-3
Authors
Martin Burkhart  ETH, Zurich, Switzerland
Daniela Brauckhoff  ETH, Zurich, Switzerland
Martin May  ETH, Zurich, Switzerland
Elisa Boschi  Hitachi Europe, Zurich, Switzerland
Sponsors
ACM: Association for Computing Machinery
SIGSAC: ACM Special Interest Group on Security, Audit, and Control
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 90,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1456441.1456452
What is a DOI?

ABSTRACT

Network operators are reluctant to share traffic data due to security and privacy concerns. Consequently, there is a lack of publicly available traces for validating and generalizing the latest results in network and security research. Anonymization is a possible solution in this context; however, it is unclear how the sanitization of data preserves characteristics important for traffic analysis. In addition, the privacy-preserving property of state-of-the-art IP address anonymization techniques has come into question by recent attacks that successfully identified a large number of hosts in anonymized traces. In this paper, we examine the tradeoff between data utility for anomaly detection and the risk of host identification for IP address truncation. Specifically, we analyze three weeks of unsampled and non-anonymized network traces from a medium-sized backbone network to assess data utility. The risk of de-anonymizing individual IP addresses is formally evaluated, using a metric based on conditional entropy. Our results indicate that truncation effectively prevents host identification but degrades the utility of data for anomaly detection. However, the degree of degradation depends on the metric used and whether network-internal or external addresses are considered. Entropy metrics are more resistant to truncation than unique counts and the detection quality of anomalies degrades much faster in internal addresses than in external addresses. In particular, the usefulness of internal address counts is lost even for truncation of only 4 bits whereas utility of external address entropy is virtually unchanged even for truncation of 20 bits.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. Bezzi. An entropy-based method for measuring anonymity. In IEEE/CreateNet SECOVAL Workshop on the Value of Security through Collaboration, September 2007.
 
2
A. Bradley. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30:1145--1159, 1997.
3
 
4
T. Brekne, A. °Arnes, and A. Øslebø. Anonymization of IP traffic data: Attacks on two prefix-preserving anonymization schemes and some proposed remedies. In Workshop on Privacy Enhancing Technologies, pages 179--196, 2005.
 
5
S. Coull, C. Wright, A. Keromytis, F. Monrose, and M. Reiter. Taming the devil: Techniques for evaluating anonymized network data. In 15th Annual Network and Distributed System Security Symposium (NDSS 08), February 2008.
 
6
S. Coull, C. Wright, F. Monrose, M. Collins, and M.K.Reiter. Playing devil's advocate: Inferring sensitive information from anonymized network traces. In 14th Annual Network and Distributed System Security Symposium, February 2007.
 
7
G. T. Duncan, S. A. Keller-McNulty, and S. L. Stokes. Disclosure risk vs. data utility: The r-u confidentiality map. Technical Report 121, National Institute of Statistical Sciences, December 2001.
 
8
EU. Directive 95/46/ec of the European parliament and of the council. OJ L 281, 23.11.1995, p. 31, October 1995.
 
9
EU. Directive 2002/58/ec of the European parliament and of the council. OJ L 201, 31.07.2002, p. 37, July 2002.
 
10
 
11
D. Koukis, S. Antonatos, and K. G. Anagnostakis. On the privacy risks of publishing anonymized IP network traces. In Communications and Multimedia Security, volume 4237 of Lecture Notes in Computer Science, pages 22--32. Springer, 2006.
 
12
A. Kounine and M. Bezzi. Assessing disclosure risk in anonymized datasets. In FloCon 2008, January 2008.
13
 
14
J. Mai, A. Sridharan, C.-N. Chuah, H. Zang, and T. Ye. Impact of packet sampling on portscan detection. Selected Areas in Communications, IEEE Journal on, 24(12):2285--2298, Dec. 2006.
 
15
G. Minshall. Tcpdpriv. http://ita.ee.lbl.gov/html/contrib/tcpdpriv.html.
16
17
 
18
B. Ribeiro, W. Chen, G. Miklau, and D. Towsley. Analyzing privacy in enterprise packet trace anonymization. In 15th Annual Network and Distributed System Security Symposium (NDSS 08), February 2008.
19
 
20
 
21
A. Soule, H. Larsen, F. Silveira, J. Rexford, and C. Diot. Detectability of traffic anomalies in two adjacent networks. In Passive And Active Measurement Conference (PAM), 2007.
 
22
 
23
 
24
SWITCH. The swiss education and research network. http://www.switch.ch.
 
25
 
26
 
27
J. Zhang, N. Borisov, and W. Yurcik. Outsourcing security analysis with anonymized logs. In Securecomm and Workshops, 2006, pages 1--9, Aug. 28 2006-Sept. 1 2006.

Collaborative Colleagues:
Martin Burkhart: colleagues
Daniela Brauckhoff: colleagues
Martin May: colleagues
Elisa Boschi: colleagues