首页> 外文会议>2019 56th ACM/IEEE Design Automation Conference >Peregrine: A FIexible Hardware Accelerator for LSTM with Limited Synaptic Connection Patterns
【24h】

Peregrine: A FIexible Hardware Accelerator for LSTM with Limited Synaptic Connection Patterns

机译:百富勤:具有有限突触连接模式的LSTM柔性硬件加速器

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present an integrated solution to design a high-performance LSTM accelerator. We propose a fast and flexible hardware architecture, named Peregrine, supported by a stack of innovations from algorithm to hardware design. Peregrine first minimizes the memory footprint by limiting the synaptic connection patterns within the LSTM network. Also, Peregrine provides parallel Huffman decoders with adaptive clocking to provide flexibility in dealing with a wide range of sparsity levels in the weight matrices. All these features are incorporated in a novel hardware architecture to maximize energy-efficiency. As a result, Peregrine improves performance by $sim$ 38% and energy-efficiency by $sim$ 33% in speech recognition compared to the state-of-the-art LSTM accelerator.
机译:在本文中,我们提出了一种集成解决方案,用于设计高性能LSTM加速器。我们提出了一种快速灵活的硬件架构,称为Peregrine,并得到了从算法到硬件设计的一系列创新支持。百富勤首先通过限制LSTM网络内的突触连接模式来最大程度地减少内存占用。另外,Peregrine还为并行的Huffman解码器提供了自适应时钟,以在处理权重矩阵的各种稀疏性级别时提供灵活性。所有这些功能都集成在新颖的硬件体系结构中,以最大限度地提高能效。结果,与最新的LSTM加速器相比,Peregrine在语音识别方面的性能提高了38%,而能源效率提高了33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号