首页> 外文会议>IEEE International Symposium on High Performance Computer Architecture >Characterizing and Mitigating Output Reporting Bottlenecks in Spatial Automata Processing Architectures
【24h】

Characterizing and Mitigating Output Reporting Bottlenecks in Spatial Automata Processing Architectures

机译:表征和缓解空间自动机处理体系结构中的输出报告瓶颈

获取原文

摘要

Automata processing has seen a resurgence in importance due to its usefulness for pattern matching and pattern mining of "big data." While large-scale automata processing is known to bottleneck von Neumann processors due to unpredictable memory accesses, spatial architectures excel at automata processing. Spatial architectures can implement automata graphs by wiring together automata states in reconfigurable arrays, allowing parallel automata state computation, and point-to-point state transitions on-chip. However, spatial automata processing architectures can suffer from output constraints (up to 255x in commercial systems!) due to the physical placement of states, output processing architecture design, I/O resources, and the massively parallel nature of the architecture. To understand this bottleneck, we conduct the first known characterization of output requirements of a realistic set of automata processing benchmarks. We find that most benchmarks report fairly frequently, but that few states report at any one time. This observation motivates new output compression schemes and reporting architectures. We evaluate the benefit of one purely software automata transformation and show that output reporting costs can be greatly reduced (improving performance by up to 40% without hardware modification. We then explore bottlenecks in the reporting architecture of a commercial spatial automata processor and propose a new architecture that improves performance by up to 5.1x.
机译:由于自动机处理对于“大数据”的模式匹配和模式挖掘很有用,因此其重要性已重新出现。尽管由于无法预料的内存访问而使大型自动机处理成为冯·诺依曼处理器的瓶颈,但空间架构在自动机处理方面表现出色。空间体系结构可以通过将自动机状态连接到可重新配置的阵列中来实现自动机图,从而允许并行自动机状态计算和片上点对点状态转换。但是,由于状态的物理位置,输出处理体系结构设计,I / O资源以及该体系结构的大规模并行性,空间自动机处理体系结构可能遭受输出约束(在商业系统中高达255倍!)。为了解此瓶颈,我们对一组实际的自动机处理基准进行了输出要求的第一个已知表征。我们发现大多数基准报告相当频繁,但很少有州一次报告。这种观察激发了新的输出压缩方案和报告体系结构。我们评估了一种纯软件自动机转换的好处,并表明可以大大降低输出报告成本(无需进行硬件修改即可将性能提高40%。然后,我们探索商用空间自动机处理器的报告架构中的瓶颈,并提出一个解决方案。新架构可将性能提高多达5.1倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号