Scalable multi-pipeline architecture for high performance multi-pattern string matching

机译：可扩展的多管线架构，用于高性能多模式字符串匹配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-pattern string matching remains a major performance bottleneck in network intrusion detection and anti-virus systems for high-speed deep packet inspection (DPI). Although Aho-Corasick deterministic finite automaton (AC-DFA) based solutions produce deterministic throughput and are widely used in today's DPI systems such as Snort [1] and ClamAV [2], the high memory requirement of AC-DFA (due to the large number of state transitions in AC-DFA) inhibits efficient hardware implementation to achieve high performance. Some recent work [3], [4] has shown that the AC-DFA can be reduced to a character trie that contains only the forward transitions by incorporating pipelined processing. But they have limitations in either handling long patterns or extensions to support multi-character input per clock cycle to achieve high throughput. This paper generalizes the problem and proves formally that a linear pipeline with H stages can remove all cross transitions to the top H levels of a AC-DFA. A novel and scalable pipeline architecture for memory-efficient multi-pattern string matching is then presented. The architecture can be easily extended to support multi-character input per clock cycle by mapping a compressed AC-DFA [5] onto multiple pipelines. Simulation using Snort and ClamAV pattern sets shows that a 8-stage pipeline can remove more than 99% of the transitions in the original AC-DFA. The implementation on a state-of-the-art field programmable gate array (FPGA) shows that our architecture can store on a single FPGA device the full set of string patterns from the latest Snort rule set. Our FPGA implementation sustains 10+ Gbps throughput, while consuming a small amount of on-chip logic resources. Also desirable scalability is achieved: the increase in resource requirement of our solution is sub-linear with the throughput improvement.

机译：多模式字符串匹配仍然是网络入侵检测和防病毒系统中的主要性能瓶颈，用于高速深度分组检验（DPI）。虽然AHO-Corasick确定性有限自动机（AC-DFA）的解决方案产生确定性吞吐量，并且广泛用于当今的DPI系统，如Snort [1]和Clamav [2]，AC-DFA的高内存要求（由于大AC-DFA中的状态转换数量抑制了高效的硬件实现以实现高性能。最近的一些工作[3]，[4]表明，可以将AC-DFA减少到仅通过结合流水线处理而仅包含前向转换的字符TRIE。但它们有限制处理长度模式或扩展，以支持每个时钟周期的多字符输入以实现高吞吐量。本文概括了问题，并证明了具有H阶段的线性管道可以去除所有交叉过渡到AC-DFA的顶部H级别。然后呈现用于记忆有效的多模式字符串匹配的新颖和可扩展的管道架构。通过将压缩的AC-DFA [5]映射到多个管道上，可以轻松扩展架构以支持每个时钟周期的多字符输入。使用Snort和CLAMAV模式集进行仿真显示，8级流水线可以在原始AC-DFA中移除超过99％的过渡。最先进的字段可编程门阵列（FPGA）的实现表明，我们的体系结构可以存储在单个FPGA设备上，从最新的Snort规则集中存储全套字符串模式。我们的FPGA实施持续了10个以上的Gbps吞吐量，同时消耗少量的片上逻辑资源。也实现了理想的可扩展性：我们解决方案的资源需求的增加是具有吞吐量改进的子线性。

著录项

来源
《IEEE International Symposium on Parallel Distributed Processing》|2010年||共12页
会议地点
作者
Weirong Jiang; Yang Y.-H.E.; Prasanna V.K.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.138-53;
关键词
DFA; Deep packet inspection; FPGA; pipeline; string matching;

机译：DFA;深度包检查;FPGA;管道;串匹配;

相似文献

外文文献
中文文献
专利

1. Parallel Length-based Matching Architecture for High Throughput Multi-Pattern Matching [J] . WANG Xiaofei, HU Chengchen, TANG Yi, 电子学报：英文版 . 2012,第003期

机译：高吞吐量多模式匹配的基于长度的并行匹配架构
2. Multi-Pattern Matching for Dictionary Compressed Strings [J] . Chen Hou, Meng Zhang, Hengshan Yue, Sensor Letters: A Journal Dedicated to all Aspects of Sensors in Science, Engineering, and Medicine . 2014,第2期

机译：字典压缩字符串的多模式匹配
3. Efficient bit-parallel multi-patterns approximate string matching algorithms [J] . Rajesh Prasad, Anuj Kumar Sharma, Alok Singh, Scientific Research and Essays . 2011,第4期

机译：高效的位并行多模式近似字符串匹配算法
4. Scalable multi-pipeline architecture for high performance multi-pattern string matching [C] . Weirong Jiang, Yang Y.-H.E., Prasanna V.K. 2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) . 2010

机译：可扩展的多管道架构，用于高性能多模式字符串匹配
5. Multi-pattern string matching algorithms. [D] . Zha, Xinyan. 2010

机译：多模式字符串匹配算法。
6. RGCA: A Reliable GPU Cluster Architecture for Large-Scale Internet of Things Computing Based on Effective Performance-Energy Optimization [O] . Yuling Fang, Qingkui Chen, Neal N. Xiong, 2017

机译：RGCA：基于有效性能-能源优化的可靠的GPU集群架构用于大规模物联网计算
7. Scalable Multi-Pipeline Architecture for High Performance Multi-Pattern String Matching [O] . Weirong Jiang, Yi-hua E. Yang, Viktor K. Prasanna 2010

机译：高性能多模式字符串匹配的可扩展多管道体系结构
8. Experiments in Parallel Fingerprint Matching - Architectural Implications for Large Scale Fingerprint Matching Evaluation Systems [R] . Fillinger, A., Diduch, L., Hamchi, I., 2011

机译：并行指纹匹配实验 - 大规模指纹匹配评估系统的结构意义

Scalable multi-pipeline architecture for high performance multi-pattern string matching

摘要

著录项

相似文献

相关主题

期刊订阅