Scalable multi-pipeline architecture for high performance multi-pattern string matching

机译：可扩展的多管道架构，用于高性能多模式字符串匹配

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-pattern string matching remains a major performance bottleneck in network intrusion detection and anti-virus systems for high-speed deep packet inspection (DPI). Although Aho-Corasick deterministic finite automaton (AC-DFA) based solutions produce deterministic throughput and are widely used in today's DPI systems such as Snort [1] and ClamAV [2], the high memory requirement of AC-DFA (due to the large number of state transitions in AC-DFA) inhibits efficient hardware implementation to achieve high performance. Some recent work [3], [4] has shown that the AC-DFA can be reduced to a character trie that contains only the forward transitions by incorporating pipelined processing. But they have limitations in either handling long patterns or extensions to support multi-character input per clock cycle to achieve high throughput. This paper generalizes the problem and proves formally that a linear pipeline with H stages can remove all cross transitions to the top H levels of a AC-DFA. A novel and scalable pipeline architecture for memory-efficient multi-pattern string matching is then presented. The architecture can be easily extended to support multi-character input per clock cycle by mapping a compressed AC-DFA [5] onto multiple pipelines. Simulation using Snort and ClamAV pattern sets shows that a 8-stage pipeline can remove more than 99% of the transitions in the original AC-DFA. The implementation on a state-of-the-art field programmable gate array (FPGA) shows that our architecture can store on a single FPGA device the full set of string patterns from the latest Snort rule set. Our FPGA implementation sustains 10+ Gbps throughput, while consuming a small amount of on-chip logic resources. Also desirable scalability is achieved: the increase in resource requirement of our solution is sub-linear with the throughput improvement.

机译：多模式字符串匹配仍然是网络入侵检测和防病毒系统中用于高速深度数据包检查（DPI）的主要性能瓶颈。尽管基于Aho-Corasick确定性有限自动机（AC-DFA）的解决方案可以产生确定性的吞吐量，并已广泛应用于当今的DPI系统中，例如Snort [1]和ClamAV [2]，但AC-DFA的内存要求很高（由于体积大，（AC-DFA中的状态转换数量）限制了有效的硬件实现以实现高性能。最近的一些工作[3]，[4]表明，通过合并流水线处理，可以将AC-DFA简化为仅包含正向转换的字符特里。但是它们在处理长模式或扩展以在每个时钟周期支持多字符输入以实现高吞吐量方面存在局限性。本文对此问题进行了概括，并正式证明了具有H级的线性管线可以消除所有交叉过渡到AC-DFA的最高H级。然后提出了一种新颖且可扩展的流水线架构，用于内存高效的多模式字符串匹配。通过将压缩的AC-DFA [5]映射到多个流水线，可以轻松扩展该体系结构以支持每个时钟周期的多字符输入。使用Snort和ClamAV模式集进行的仿真显示，8级流水线可以消除原始AC-DFA中超过99％的过渡。最新的现场可编程门阵列（FPGA）的实现表明，我们的体系结构可以将来自最新Snort规则集的完整字符串模式集存储在单个FPGA器件上。我们的FPGA实现可维持10+ Gbps的吞吐量，同时消耗少量的片上逻辑资源。还可以实现理想的可伸缩性：我们解决方案的资源需求增加与吞吐量的提高呈线性关系。

著录项

来源
《2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS)》|2010年|p.1-12|共12页
会议地点 Atlanta GA(US)
作者
Weirong Jiang; Yang Y.-H.E.; Prasanna V.K.;
展开▼
作者单位

Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.133;
关键词
DFA; Deep packet inspection; FPGA; pipeline; string matching;

机译：DFA;深度数据包检查; FPGA;管道;字符串匹配;

相似文献

外文文献
中文文献
专利

1. Parallel Length-based Matching Architecture for High Throughput Multi-Pattern Matching [J] . WANG Xiaofei, HU Chengchen, TANG Yi, 电子学报：英文版 . 2012,第003期

机译：高吞吐量多模式匹配的基于长度的并行匹配架构
2. Multi-Pattern Matching for Dictionary Compressed Strings [J] . Chen Hou, Meng Zhang, Hengshan Yue, Sensor Letters: A Journal Dedicated to all Aspects of Sensors in Science, Engineering, and Medicine . 2014,第2期

机译：字典压缩字符串的多模式匹配
3. Efficient bit-parallel multi-patterns approximate string matching algorithms [J] . Rajesh Prasad, Anuj Kumar Sharma, Alok Singh, Scientific Research and Essays . 2011,第4期

机译：高效的位并行多模式近似字符串匹配算法
4. Scalable multi-pipeline architecture for high performance multi-pattern string matching [C] . Jiang Weirong, Yang Yi-Hua E., Prasanna Viktor K. 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：可扩展的多管道架构，用于高性能多模式字符串匹配
5. Multi-pattern string matching algorithms. [D] . Zha, Xinyan. 2010

机译：多模式字符串匹配算法。
6. RGCA: A Reliable GPU Cluster Architecture for Large-Scale Internet of Things Computing Based on Effective Performance-Energy Optimization [O] . Yuling Fang, Qingkui Chen, Neal N. Xiong, 2017

机译：RGCA：基于有效性能-能源优化的可靠的GPU集群架构用于大规模物联网计算
7. Scalable Multi-Pipeline Architecture for High Performance Multi-Pattern String Matching [O] . Weirong Jiang, Yi-hua E. Yang, Viktor K. Prasanna 2010

机译：高性能多模式字符串匹配的可扩展多管道体系结构
8. Experiments in Parallel Fingerprint Matching - Architectural Implications for Large Scale Fingerprint Matching Evaluation Systems [R] . Fillinger, A., Diduch, L., Hamchi, I., 2011

机译：并行指纹匹配实验 - 大规模指纹匹配评估系统的结构意义

Scalable multi-pipeline architecture for high performance multi-pattern string matching

摘要

著录项

相似文献

相关主题

期刊订阅