ACM Home Page
Please provide us with feedback. Feedback
Curing regular expressions matching algorithms from insomnia, amnesia, and acalculia
Full text PdfPdf (372 KB)
Source
Symposium On Architecture For Networking And Communications Systems archive
Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems table of contents
Orlando, Florida, USA
SESSION: Regular expressions & pattern matching table of contents
Pages 155-164  
Year of Publication: 2007
ISBN:978-1-59593-945-6
Authors
Sailesh Kumar  Washington University
Balakrishnan Chandrasekaran  Washington University
Jonathan Turner  Washington University
George Varghese  University of California, San Diego
Sponsors
SIGARCH: ACM Special Interest Group on Computer Architecture
ACM: Association for Computing Machinery
SIGCOMM: ACM Special Interest Group on Data Communication
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 89,   Citation Count: 7
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1323548.1323574
What is a DOI?

ABSTRACT

The importance of network security has grown tremendously and a collection of devices have been introduced, which can improve the security of a network. Network intrusion detection systems (NIDS) are among the most widely deployed such system; popular NIDS use a collection of signatures of known security threats and viruses, which are used to scan each packet's payload. Today, signatures are often specified as regular expressions; thus the core of the NIDS comprises of a regular expressions parser; such parsers are traditionally implemented as finite automata. Deterministic Finite Automata (DFA) are fast, therefore they are often desirable at high network link rates. DFA for the signatures, which are used in the current security devices, however require prohibitive amounts of memory, which limits their practical use.

In this paper, we argue that the traditional DFA based NIDS has three main limitations: first they fail to exploit the fact that normal data streams rarely match any virus signature; second, DFAs are extremely inefficient in following multiple partially matching signatures and explodes in size, and third, finite automaton are incapable of efficiently keeping track of counts. We propose mechanisms to solve each of these drawbacks and demonstrate that our solutions can implement a NIDS much more securely and economically, and at the same time substantially improve the packet throughput.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
J. Hopcroft, "An nlogn algorithm for minimizing states in a finite automaton," in Theory of Machines and Computation, J. Kohavi, Ed. New York: Academic, 1971, pp. 189--196.
 
4
Bro: A System for Detecting Network Intruders in Real-Time. http://www.icir.org/vern/bro-info.html
 
5
 
6
S. Antonatos, et. al, "Generating realistic workloads for network intrusion detection systems," In ACM Workshop on S & P, 2004.
7
 
8
 
9
S. Wu, U. Manber, "A fast algorithm for multi-pattern searching," Tech. R. TR-94-17, Dept. of Comp. Science, Univ of Arizona, 1994.
 
10
Fang Yu, et al., "Fast and Memory-Efficient Regular Expression Matching for Deep Packet Inspection", UCB tech. report, 2005.
 
11
N. Tuck, T. Sherwood, B. Calder, and G. Varghese, "Deterministic memory-efficient string matching algorithms for intrusion detection," IEEE Infocom 2004, pp. 333--340.
12
 
13
 
14
S. Yusuf and W. Luk, "Bitwise Optimised CAM for Network Intrusion Detection Systems," IEEE FPL 2005.
 
15
 
16
C. R. Clark and D. E. Schimmel, "Efficient reconfigurable logic circuit for matching complex network intrusion detection patterns," In Proceedings of 13th International Conference on Field Program.
 
17
18
 
19
Scott Tyler Shafer, Mark Jones, "Network edge courts apps," http://infoworld.com/article/02/05/27/020527newebdev_1.html
 
20
TippingPoint X505, www.tippingpoint.com/products_ips.html
 
21
Cisco IOS IPS Deployment Guide, www.cisco.com
 
22
Tarari RegEx, www. tarari.com/PDF/RegEx_FACT_SHEET.pdf
 
23
N. J. Larsson, "Structures of string matching and data compression," PhD thesis, Dept. of Computer Science, Lund University, 1999.
 
24
S. Dharmapurikar, P. Krishnamurthy, T. Sproull, and J. Lockwood, "Deep Packet Inspection using Parallel Bloom Filters," IEEE Hot Interconnects 12, August 2003. IEEE Computer Society Press.
 
25
Z. K. Baker, V. K. Prasanna, "Automatic Synthesis of Efficient Intrusion Detection Systems on FPGAs," in Field Prog. Logic and Applications, Aug. 2004, pp. 311--321.
 
26
 
27
 
28
J. Levandoski, E. Sommer, and M. Strait, "Application Layer Packet Classifier for Linux". http://l7-filter.sourceforge.net/.
 
29
"MIT DARPA Intrusion Detection Data Sets," http://www. ll.mit.edu/IST/ideval/data/2000/2000_data_index.html.
 
30
Vern Paxson et al., "Flex: A fast scanner generator".
 
31
SafeXcel Content Inspection Engine, hardware regex acceleration IP.
 
32
Network Services Processor, OCTEON CN31XX, CN30XX Family.
 
33
Will Eatherton, John Williams, "An encoded version of reg-ex database from cisco systems provided for research purposes".
34
 
35
S. Kumar, B. Chandrasekaran, J. Turner, and G. Varghese, "Curing Regular Expressions Matching Algorithms from Insomnia, Amnesia, and Acalculia", Washington University technical report, 2006.

CITED BY  8

Collaborative Colleagues:
Sailesh Kumar: colleagues
Balakrishnan Chandrasekaran: colleagues
Jonathan Turner: colleagues
George Varghese: colleagues