首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >Scalable multi-pipeline architecture for high performance multi-pattern string matching
【24h】

Scalable multi-pipeline architecture for high performance multi-pattern string matching

机译:可扩展的多管道架构,用于高性能多模式字符串匹配

获取原文
获取原文并翻译 | 示例

摘要

Multi-pattern string matching remains a major performance bottleneck in network intrusion detection and anti-virus systems for high-speed deep packet inspection (DPI). Although Aho-Corasick deterministic finite automaton (AC-DFA) based solutions produce deterministic throughput and are widely used in today's DPI systems such as Snort [1] and ClamAV [2], the high memory requirement of AC-DFA (due to the large number of state transitions in AC-DFA) inhibits efficient hardware implementation to achieve high performance. Some recent work [3], [4] has shown that the AC-DFA can be reduced to a character trie that contains only the forward transitions by incorporating pipelined processing. But they have limitations in either handling long patterns or extensions to support multi-character input per clock cycle to achieve high throughput. This paper generalizes the problem and proves formally that a linear pipeline with H stages can remove all cross transitions to the top H levels of a AC-DFA. A novel and scalable pipeline architecture for memory-efficient multi-pattern string matching is then presented. The architecture can be easily extended to support multi-character input per clock cycle by mapping a compressed AC-DFA [5] onto multiple pipelines. Simulation using Snort and ClamAV pattern sets shows that a 8-stage pipeline can remove more than 99% of the transitions in the original AC-DFA. The implementation on a state-of-the-art field programmable gate array (FPGA) shows that our architecture can store on a single FPGA device the full set of string patterns from the latest Snort rule set. Our FPGA implementation sustains 10+ Gbps throughput, while consuming a small amount of on-chip logic resources. Also desirable scalability is achieved: the increase in resource requirement of our solution is sub-linear with the throughput improvement.
机译:多模式字符串匹配仍然是网络入侵检测和防病毒系统中用于高速深度数据包检查(DPI)的主要性能瓶颈。尽管基于Aho-Corasick确定性有限自动机(AC-DFA)的解决方案可以产生确定性的吞吐量,并已广泛应用于当今的DPI系统中,例如Snort [1]和ClamAV [2],但AC-DFA的内存要求很高(由于体积大, (AC-DFA中的状态转换数量)限制了有效的硬件实现以实现高性能。最近的一些工作[3],[4]表明,通过合并流水线处理,可以将AC-DFA简化为仅包含正向转换的字符特里。但是它们在处理长模式或扩展以在每个时钟周期支持多字符输入以实现高吞吐量方面存在局限性。本文对此问题进行了概括,并正式证明了具有H级的线性管线可以消除所有交叉过渡到AC-DFA的最高H级。然后提出了一种新颖且可扩展的流水线架构,用于内存高效的多模式字符串匹配。通过将压缩的AC-DFA [5]映射到多个流水线,可以轻松扩展该体系结构以支持每个时钟周期的多字符输入。使用Snort和ClamAV模式集进行的仿真显示,8级流水线可以消除原始AC-DFA中超过99%的过渡。最新的现场可编程门阵列(FPGA)的实现表明,我们的体系结构可以将来自最新Snort规则集的完整字符串模式集存储在单个FPGA器件上。我们的FPGA实现可维持10+ Gbps的吞吐量,同时消耗少量的片上逻辑资源。还可以实现理想的可伸缩性:我们解决方案的资源需求增加与吞吐量的提高呈线性关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号