Journal Article10.3390/app13148104
Accelerating Pattern Matching Using a Novel Multi-Pattern-Matching Algorithm on GPU
M. Çelebi,Uraz Yavanoglu +1 more
2
TL;DR: In this paper , a multi-pattern-matching algorithm was proposed to reduce the memory space and time required in the DPI pattern matching compared to traditional automaton-based algorithms with its ability to process more than one packet payload character at once.
read more
Abstract: Nowadays, almost all network traffic is encrypted. Attackers hide themselves using this traffic and attack over encrypted channels. Inspections performed only on packet headers and metadata are insufficient for detecting cyberattacks over encrypted channels. Therefore, it is important to analyze packet contents in applications that require control over payloads, such as content filtering, intrusion detection systems (IDSs), data loss prevention systems (DLPs), and fraud detection. This technology, known as deep packet inspection (DPI), provides full control over the communication between two end stations by keenly analyzing the network traffic. This study proposes a multi-pattern-matching algorithm that reduces the memory space and time required in the DPI pattern matching compared to traditional automaton-based algorithms with its ability to process more than one packet payload character at once. The pattern-matching process in the DPI system created to evaluate the performance of the proposed algorithm (PA) is conducted on the graphics processing unit (GPU), which accelerates the processing of network packets with its parallel computing capability. This study compares the PA with the Aho-Corasick (AC) and Wu–Manber (WM) algorithms, which are widely used in the pattern-matching process, considering the memory space required and throughput obtained. Algorithm tables created with a dataset containing 500 patterns use 425 and 688 times less memory space than those of the AC and WM algorithms, respectively. In the pattern-matching process using these tables, the PA is 3.5 and 1.5 times more efficient than the AC and WM algorithms, respectively.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Pattern Recognition for Identifying a String Within a Text File Using Finite Automata
Mamidi Prajana,Harsha Rajkumar,Akepati Sai Sannidhi,Kammari Vidyasri,Niharika Panda +4 more
- 24 Jun 2024
TL;DR: This study introduces a pattern recognition application using finite state machines, implemented with the SSFSM library in Python, to efficiently identify strings within text files, demonstrating effectiveness and scalability for large files and varying search key lengths.
1
Multi-Pattern GPU Accelerated Collision-Less Rabin-Karp for NIDS
A. Abbas,Mahmoud Fayez,Heba Khaled +2 more
TL;DR: Multi-pattern GPU accelerated collision-less Rabin-Karp for NIDS efficiently handles large-scale network traffic while resisting DOS attacks. The system utilizes six polynomial hashing functions to achieve scalability and accuracy.
References
Space/time trade-offs in hash coding with allowable errors
TL;DR: Analysis of the paradigm problem demonstrates that allowing a small number of test messages to be falsely identified as members of the given set will permit a much smaller hash area to be used without increasing reject time.
•Proceedings Article
Snort - Lightweight Intrusion Detection for Networks
Martin Roesch
- 12 Nov 1999
TL;DR: Snort provides a layer of defense which monitors network traffic for predefined suspicious activity or patterns, and alert system administrators when potential hostile traffic is detected.
Efficient string matching: an aid to bibliographic search
TL;DR: A simple, efficient algorithm to locate all occurrences of any of a finite number of keywords in a string of text that has been used to improve the speed of a library bibliographic search program by a factor of 5 to 10.
A fast string searching algorithm
TL;DR: The algorithm has the unusual property that, in most cases, not all of the first i .” in another string, are inspected.
Summary cache: a scalable wide-area web cache sharing protocol
TL;DR: This paper demonstrates the benefits of cache sharing, measures the overhead of the existing protocols, and proposes a new protocol called "summary cache", which reduces the number of intercache protocol messages, reduces the bandwidth consumption, and eliminates 30% to 95% of the protocol CPU overhead, all while maintaining almost the same cache hit ratios as ICP.
2.3K