【24h】

Cache Automaton

机译:缓存自动化

获取原文

摘要

Finite State Automata are widely used to accelerate pattern matching in many emerging application domains like DNA sequencing and XML parsing. Conventional CPUs and compute-centric accelerators are bottlenecked by memory bandwidth and irregular memory access patterns in automata processing. We present Cache Automaton, which repurposes last-level cache for automata processing, and a compiler that automates the process of mapping large real world Non-Deterministic Finite Automata (NFAs) to the proposed architecture. Cache Automaton extends a conventional last-level cache architecture with components to accelerate two phases in NFA processing: state-match and state-transition. State-matching is made efficient using a sense-amplifier cycling technique that exploits spatial locality in symbol matches. State-transition is made efficient using a new compact switch architecture. By overlapping these two phases for adjacent symbols we realize an efficient pipelined design. We evaluate two designs, one optimized for performance and the other optimized for space, across a set of 20 diverse benchmarks. The performance optimized design provides a speedup of 15× over DRAM-based Micron's Automata Processor and 3840× speedup over processing in a conventional x86 CPU. The proposed design utilizes on an average 1.2 MB of cache space across benchmarks, while consuming 2.3 nJ of energy per input symbol. Our space optimized design can reduce the cache utilization to 0.72 MB, while still providing a speedup of 9× over AP.
机译:有限状态自动机广泛用于加速许多新兴应用结构域中的模式匹配,如DNA测序和XML解析。传统的CPU和Compute的加速器是通过存储器带宽和自动机处理中的不规则内存访问模式的瓶颈。我们提出了缓存自动机,该机构修复了自动数据处理的最后级别缓存,以及自动映射大型现实世界非确定性有限自动机(NFAS)的过程到所提出的体系结构的编译器。 Cache Automaton与组件扩展了传统的最后一级缓存架构,以加速NFA处理中的两个阶段:状态匹配和状态转换。使用感测放大器循环技术进行状态匹配,该技术利用符号匹配中的空间局部性。使用新的紧凑型开关架构进行了高效的状态转换。通过重叠这两个相对于相邻符号的阶段,我们实现了一种有效的流水线设计。我们评估了两种设计,针对性能优化,另一组用于空间优化,跨越一组20个不同的基准。性能优化的设计在传统的X86 CPU中提供了基于DRAM的Micron的自动机处理器和3840×加速度的加速15×。所提出的设计利用平均基准的1.2 MB的缓存空间,同时每次输入符号消耗2.3 NJ的能量。我们的空间优化设计可以将高速缓存利用率降低到0.72 MB,同时仍然在AP上提供9倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号