【24h】

An Overlay Architecture for Pattern Matching

机译:模式匹配的叠加架构

获取原文

摘要

Deterministic and Non-deterministic Finite Automata (DFA and NFA) comprise the fundamental unit of work for many emerging big-data applications, motivating recent efforts to develop Domain-Specific architectures (DSAs) to exploit fine-grain parallelism available in automata workloads. In this paper we present NAPOLY (Non-Deterministic Automata Processor OverLaY), an overlay architecture and associated software that attempts to maximally exploit on-chip memory parallelism for NFA evaluation. In order to avoid an upper bound on NFA size that commonly affects prior efforts, NAPOLY is optimized for runtime reconfiguration, allowing for full reconfiguration in 10s of microseconds. NAPOLY is also parameterizable, allowing for offline generation of a repertoire of overlay configurations with various trade-offs between state capacity and transition capacity. In this paper we evaluate NAPOLY using our proposed state mapping heuristic and the ANMLZoo benchmark suite, and we compare NAPOLY's performance against existing CPU and GPU implementations. To the best of the authors' knowledge this is the first example of a runtime-reprogrammable FPGA-based automata processor overlay.
机译:确定性和非确定性有限自动机(DFA和NFA)构成了许多新兴大数据应用程序的基本工作单元,这激发了最近为开发域专用体系结构(DSA)来利用自动机工作负载中可用的细粒度并行性的努力。在本文中,我们介绍了NAPOLY(非确定性自动机处理器OverLaY),这是一个覆盖体系结构和相关软件,旨在最大程度地利用片上存储器并行性进行NFA评估。为了避免通常影响先前工作的NFA大小上限,NAPOLY已针对运行时重新配置进行了优化,从而允许在10微秒内进行完全重新配置。 NAPOLY也是可参数化的,允许离线生成覆盖配置的全部内容,并在状态容量和过渡容量之间进行各种折衷。在本文中,我们使用提议的状态映射试探法和ANMLZoo基准套件评估了NAPOLY,并且将NAPOLY的性能与现有的CPU和GPU实现进行了比较。据作者所知,这是基于运行时可重新编程的基于FPGA的自动机处理器覆盖的第一个示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号