首页> 外文会议>International Conference on Field-Programmable Technology >A High Energy-Efficiency FPGA-Based LSTM Accelerator Architecture Design by Structured Pruning and Normalized Linear Quantization
【24h】

A High Energy-Efficiency FPGA-Based LSTM Accelerator Architecture Design by Structured Pruning and Normalized Linear Quantization

机译:基于结构化修剪和归一化线性量化的基于FPGA的高能效LSTM加速器架构设计

获取原文

摘要

LSTM (Long Short-Term Memory) is an artificial recurrent neural network (RNN) architecture and has been successfully applied to the areas where sequences of data need to be dealt with such as Natural Language Processing (NLP), speech recognition, etc. In this work, we explore an avenue to minimization of the LSTM inference part design based on FPGA for high performance and energy-efficiency. First, the model is pruned to create structured sparse features for the hardware-friendly purpose by using permuted block diagonal mask matrices. To further compress the model, we quantize the weights and activations following a normalized linear quantization approach. As a result, computational activities of the network are significantly deducted with an egligible loss on accuracy. Then a hardware architecture design has been devised to fully exploit the benefits of regular sparse structure. Having been implemented on Arria 10 (10AX115U4F45I3SG) FPGA running at 150 MHz, our accelerator demonstrates a peak performance of 2.22 TOPS at a power dissipation of 1.679 Watts. In comparison to the other FPGA-based LSTM accelerator designs previously reported, our approach achieves a 1.17-2.16x speedup in processing.
机译:LSTM(长期内存)是一个人工复发性神经网络(RNN)架构,并已成功应用于需要处理的数据序列,例如自然语言处理(NLP),语音识别等的区域这项工作,我们探索了基于FPGA的LSTM推理部件设计的途径,以实现高性能和能效。首先,通过使用允许的块对角线掩模矩阵,修剪模型以为硬件友好目的创建结构化稀疏功能。为了进一步压缩模型,我们通过归一化线性量化方法来量化权重和激活。因此,网络的计算活动得到了专利的准确性损失的显着扣除。然后,已经设计了硬件架构设计以充分利用常规稀疏结构的好处。在Arria 10(10AX115U4F45I3SG)FPGA以150 MHz运行,我们的加速器展示了1.679瓦的功耗的2.22顶部的峰值性能。与此前报道的其他基于FPGA的LSTM加速器设计相比,我们的方法在处理中实现了1.17-2.16倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号