Scalable multi-pipeline architecture for high performance multi-pattern string matching

Network flow classification is a key function in high-speed switches and routers. It directly determines the performance of network devices. With the development of the Internet and various kinds of applications, the flow classification needs to support multi-dimensional fields, large rule sets, and sustain a high throughput. Software-based classification cannot meet the performance requirement as high as 100 Gbps. FPGA-based flow classification methods can achieve a very high throughput. However, the range matching is still challenging. For this, this paper proposes a range supported bit vector (RSBV) method. First, the characteristic of range matching is analyzed, then the rules are pre-encoded and stored in memory. Second, the fields of an input packet header are used as addresses to read the memory, and the result of range matching is derived through pipelined Boolean operations. On this basis, bit vector for any types of fields (AFBV) is further proposed, which supports the flow classification for multi-dimensional fields efficiently, including exact matching, longest prefix matching, range matching, and arbitrary wildcard matching. The proposed methods are implemented in FPGA platform. Through a two-dimensional pipeline architecture, the AFBV can operate at a high clock frequency and can achieve a processing speed of more than 100 Gbps. Simulation results show that for a rule set of 512-bit width and 1[Formula: see text]k rules, the AFBV can achieve a throughput of 520 million packets per second (MPPS). The performance is improved by 44% compared with FSBV and 30% compared with Stride BV. The power consumption is reduced by about 43% compared with TCAM solution.

Download Full-text

High-performance pipeline architecture for packet classification accelerator in DPU

10.1109/icfpt52863.2021.9609841 ◽

2021 ◽

Author(s):

Jing Tan ◽

GaoFeng Lv ◽

Yanni Ma ◽

GuanJie Qiao

Keyword(s):

High Performance ◽

Packet Classification ◽

Pipeline Architecture

Download Full-text

High performance string matching algorithm for a network intrusion prevention system (NIPS)

2006 Workshop on High Performance Switching and Routing ◽

10.1109/hpsr.2006.1709697 ◽

2006 ◽

Cited By ~ 28

Author(s):

Y. Weinsberg ◽

S. Tzur-David ◽

D. Dolev ◽

T. Anker

Keyword(s):

High Performance ◽

String Matching ◽

Matching Algorithm ◽

Intrusion Prevention ◽

Network Intrusion ◽

Prevention System ◽

Intrusion Prevention System

Download Full-text

Pipeline Implementation of Polyphase PSO for Adaptive Beamforming Algorithm

Wireless Communications and Mobile Computing ◽

10.1155/2017/3926821 ◽

2017 ◽

Vol 2017 ◽

pp. 1-12

Author(s):

Shaobing Huang ◽

Li Yu ◽

Fangjian Han ◽

Yiwen Luo

Keyword(s):

High Performance ◽

Dynamic Range ◽

Large Population ◽

Optimal Solution ◽

Pso Algorithm ◽

Adaptive Beamforming ◽

Pipeline Architecture ◽

Polyphase Filter ◽

Bank Structure ◽

Pipeline Implementation

Adaptive beamforming is a powerful technique for anti-interference, where searching and tracking optimal solutions are a great challenge. In this paper, a partial Particle Swarm Optimization (PSO) algorithm is proposed to track the optimal solution of an adaptive beamformer due to its great global searching character. Also, due to its naturally parallel searching capabilities, a novel Field Programmable Gate Arrays (FPGA) pipeline architecture using polyphase filter bank structure is designed. In order to perform computations with large dynamic range and high precision, the proposed implementation algorithm uses an efficient user-defined floating-point arithmetic. In addition, a polyphase architecture is proposed to achieve full pipeline implementation. In the case of PSO with large population, the polyphase architecture can significantly save hardware resources while achieving high performance. Finally, the simulation results are presented by cosimulation with ModelSim and SIMULINK.

Download Full-text

High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs

2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS) ◽

10.1109/icpads.2015.77 ◽

2015 ◽

Author(s):

Cheng-Hung Lin ◽

Chun-Cheng Huang

Keyword(s):

High Performance ◽

String Matching ◽

Approximate String Matching ◽

Location Aware

Download Full-text

CUgrep: A GPU-based high performance multi-string matching system

2010 2nd International Conference on Future Computer and Communication ◽

10.1109/icfcc.2010.5497832 ◽

2010 ◽

Cited By ~ 3

Author(s):

Jiangfeng Peng ◽

Hu Chen

Keyword(s):

High Performance ◽

String Matching

Download Full-text

Cache Locality-Centric Parallel String Matching on Many-Core Accelerator Chips

Scientific Programming ◽

10.1155/2015/937694 ◽

2015 ◽

Vol 2015 ◽

pp. 1-20 ◽

Cited By ~ 1

Author(s):

Nhat-Phuong Tran ◽

Myungho Lee ◽

Dong Hoon Choi

Keyword(s):

High Performance ◽

Parallel Implementation ◽

String Matching ◽

Processing Unit ◽

Xeon Phi ◽

Intel Xeon Phi ◽

Multiple Threads ◽

The Many ◽

Many Core ◽

Intel Xeon

Aho-Corasick (AC) algorithm is a multiple patterns string matching algorithm commonly used in computer and network security and bioinformatics, among many others. In order to meet the highly demanding computational requirements imposed on these applications, achieving high performance for the AC algorithm is crucial. In this paper, we present a high performance parallelization of the AC on the many-core accelerator chips such as the Graphic Processing Unit (GPU) from Nvidia and the Intel Xeon Phi. Our parallelization approach significantly improves the cache locality of the AC by partitioning a given set of string patterns into multiple smaller sets of patterns in a space-efficient way. Using the multiple pattern sets, intensive pattern matching operations are concurrently conducted with respect to the whole input text data. Compared with the previous approaches where the input data is partitioned amongst multiple threads instead of partitioning the pattern set, our approach significantly improves the performance. Experimental results show that our approach leads up to 2.73 times speedup on the Nvidia K20 GPU and 2.00 times speedup on the Intel Xeon Phi compared with the previous approach. Our parallel implementation delivers up to 693 Gbps throughput performance on the K20.

Download Full-text

Scalable multi-pipeline architecture for high performance multi-pattern string matching

FASTRUN - A High Performance Computing Device for Molecular Mechanics Using a Pipeline Architecture

A high performance 5 stage pipeline architecture for the H.264/AVC deblocking filter

A High-Performance and Memory-Efficient Pipeline Architecture for the 5/3 and 9/7 Discrete wavelet Transform of JPEG2000 Codec

AFBV: A High-Performance Network Flow Classification Method for Multi-Dimensional Fields and FPGA Implementation

High-performance pipeline architecture for packet classification accelerator in DPU

High performance string matching algorithm for a network intrusion prevention system (NIPS)

Pipeline Implementation of Polyphase PSO for Adaptive Beamforming Algorithm

High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs

CUgrep: A GPU-based high performance multi-string matching system

Cache Locality-Centric Parallel String Matching on Many-Core Accelerator Chips

Export Citation Format