ACM Home Page
Please provide us with feedback. Feedback
Reversible sketches for efficient and accurate change detection over network data streams
Full text PdfPdf (161 KB)
Source Internet Measurement Conference archive
Proceedings of the 4th ACM SIGCOMM conference on Internet measurement table of contents
Taormina, Sicily, Italy
SESSION: Detection table of contents
Pages: 207 - 212  
Year of Publication: 2004
ISBN:1-58113-821-0
Authors
Robert Schweller  Northwestern University, Evanston, IL
Ashish Gupta  Northwestern University, Evanston, IL
Elliot Parsons  Northwestern University, Evanston, IL
Yan Chen  Northwestern University, Evanston, IL
Sponsors
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 39,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1028788.1028814
What is a DOI?

ABSTRACT

Traffic anomalies such as failures and attacks are increasing in frequency and severity, and thus identifying them rapidly and accurately is critical for large network operators. The detection typically treats the traffic as a collection of flows and looks for heavy changes in traffic patterns (<i>e.g.</i>, volume, number of connections). However, as link speeds and the number of flows increase, keeping per-flow state is not scalable. The recently proposed sketch-based schemes [14] are among the very few that can detect heavy changes and anomalies over massive data streams at network traffic speeds. However, sketches do not preserve the key (<i>e.g.</i>, source IP address) of the flows. Hence, even if anomalies are detected, it is difficult to infer the culprit flows, making it a big practical hurdle for online deployment. Meanwhile, the number of keys is too large to record.

To address this challenge, we propose efficient <i>reversible hashing</i> algorithms to infer the keys of culprit flows from sketches without storing any explicit key information. No extra memory or memory accesses are needed for recording the streaming data. Meanwhile, the heavy change detection daemon runs in the background with space complexity and computational time sublinear to the key space size. This short paper describes the conceptual framework of the reversible sketches, as well as some initial approaches for implementation. See [23] for the optimized algorithms in details. comment We further apply various emph IP-mangling algorithms and emph bucket classification methods to reduce the false positives and false negatives. Evaluated with netflow traffic traces of a large edge router, we demonstrate that the reverse hashing can quickly infer the keys of culprit flows even for many changes with high accuracy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Cormode, G., et al.Finding hierarchical heavy hitters in data streams. In Proc. of VLDB (2003).
3
 
4
Cormode, G., and Muthukrishnan, S. Improved data stream summaries: The count-min sketch and its applications. Tech. Rep. 2003-20, DIMACS, 2003.
 
5
Cormode, G., and Muthukrishnan, S. What's new: Finding significant differences in network data streams. In Proc. of IEEE Infocom (2004).
6
7
8
 
9
 
10
Gilbert, A. C., et al. QuickSAND: Quick summary and analysis of network data. Tech. Rep. 2001-43, DIMACS, 2001.
 
11
Gilbert, A. C., Kotidis, Y., Muthukrishnan, S., and Strauss, M. J. Quick SAND: Quick summary and analysis of network data. Tech. Rep. 2001-43, DIMACS, 2001.
 
12
 
13
Hofmeyr, S., and Forrest, S. Intrusion detection using sequences of system calls. Journal of Computer Security 6 (1998).
14
 
15
Loyall, J. P., et al. Building adaptive and agile applications using intrusion detection and response. In Proc. of NDSS (2000).
 
16
Manku, G. S., and Motwani, R. Approximate frequency counts over data streams. In Proc. of IEEE VLDB (2002).
 
17
 
18
Moore, D., et al. The spread of the Sapphire/Slammer worm. IEEE Magazine on Security and Privacy (August 2003).
 
19
 
20
 
21
Roesch, M. Snort: The lightweight network intrusion detection system, 2001. http://www.snort.org/.
 
22
Ryutov, T., Neuman, C., Kim, D., and Zhou, L. Integrated access control and intrusion detection for web servers. IEEE Trans. on Parallel and Distributed Systems 14, 9 (2003), 841--850.
 
23
Schweller, R., Chen, Y., Parsons, E., Gupta, A., Memik, G., and Zhang, Y. Reverse hashing for sketch-based change detection on high-speed networks. Tech. Rep. NWU-CS-2004-45, Northwestern University, May 2004.
 
24
 
25
 
26
Weaver, N., Paxson, V., Staniford, S., and Cunningham, R. Large scale malicious code: A research agenda. Tech. Rep. DARPA-sponsored report, 2003.
 
27
Xilinx Inc. Virtex-II Pro and Virtex-II Pro X Platform FPGAs: Complete data sheet, 2004. www.xilinx.com/bvdocs/publications/ds083.pdf.


Collaborative Colleagues:
Robert Schweller: colleagues
Ashish Gupta: colleagues
Elliot Parsons: colleagues
Yan Chen: colleagues