首页> 外文会议>IEEE International Symposium on Parallel Distributed Processing >Scalable multi-pipeline architecture for high performance multi-pattern string matching
【24h】

Scalable multi-pipeline architecture for high performance multi-pattern string matching

机译:可扩展的多管线架构,用于高性能多模式字符串匹配

获取原文

摘要

Multi-pattern string matching remains a major performance bottleneck in network intrusion detection and anti-virus systems for high-speed deep packet inspection (DPI). Although Aho-Corasick deterministic finite automaton (AC-DFA) based solutions produce deterministic throughput and are widely used in today's DPI systems such as Snort [1] and ClamAV [2], the high memory requirement of AC-DFA (due to the large number of state transitions in AC-DFA) inhibits efficient hardware implementation to achieve high performance. Some recent work [3], [4] has shown that the AC-DFA can be reduced to a character trie that contains only the forward transitions by incorporating pipelined processing. But they have limitations in either handling long patterns or extensions to support multi-character input per clock cycle to achieve high throughput. This paper generalizes the problem and proves formally that a linear pipeline with H stages can remove all cross transitions to the top H levels of a AC-DFA. A novel and scalable pipeline architecture for memory-efficient multi-pattern string matching is then presented. The architecture can be easily extended to support multi-character input per clock cycle by mapping a compressed AC-DFA [5] onto multiple pipelines. Simulation using Snort and ClamAV pattern sets shows that a 8-stage pipeline can remove more than 99% of the transitions in the original AC-DFA. The implementation on a state-of-the-art field programmable gate array (FPGA) shows that our architecture can store on a single FPGA device the full set of string patterns from the latest Snort rule set. Our FPGA implementation sustains 10+ Gbps throughput, while consuming a small amount of on-chip logic resources. Also desirable scalability is achieved: the increase in resource requirement of our solution is sub-linear with the throughput improvement.
机译:多模式字符串匹配仍然是网络入侵检测和防病毒系统中的主要性能瓶颈,用于高速深度分组检验(DPI)。虽然AHO-Corasick确定性有限自动机(AC-DFA)的解决方案产生确定性吞吐量,并且广泛用于当今的DPI系统,如Snort [1]和Clamav [2],AC-DFA的高内存要求(由于大AC-DFA中的状态转换数量抑制了高效的硬件实现以实现高性能。最近的一些工作[3],[4]表明,可以将AC-DFA减少到仅通过结合流水线处理而仅包含前向转换的字符TRIE。但它们有限制处理长度模式或扩展,以支持每个时钟周期的多字符输入以实现高吞吐量。本文概括了问题,并证明了具有H阶段的线性管道可以去除所有交叉过渡到AC-DFA的顶部H级别。然后呈现用于记忆有效的多模式字符串匹配的新颖和可扩展的管道架构。通过将压缩的AC-DFA [5]映射到多个管道上,可以轻松扩展架构以支持每个时钟周期的多字符输入。使用Snort和CLAMAV模式集进行仿真显示,8级流水线可以在原始AC-DFA中移除超过99%的过渡。最先进的字段可编程门阵列(FPGA)的实现表明,我们的体系结构可以存储在单个FPGA设备上,从最新的Snort规则集中存储全套字符串模式。我们的FPGA实施持续了10个以上的Gbps吞吐量,同时消耗少量的片上逻辑资源。也实现了理想的可扩展性:我们解决方案的资源需求的增加是具有吞吐量改进的子线性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号