...
首页> 外文期刊>IEICE Electronics Express >An efficient ReRAM-based inference accelerator for convolutional neural networks via activation reuse
【24h】

An efficient ReRAM-based inference accelerator for convolutional neural networks via activation reuse

机译:通过激活重用的卷积神经网络的高效基于RERAM推理加速器

获取原文
           

摘要

In this paper, a novel resistive random access memory (ReRAM) based accelerator is proposed for convolution neural network (CNN) inference accelerations. In ReRAM-based CNN computation, weight parameters can be pre-programmed in ReRAM crossbar arrays, and activations are generated by processing the multiplication-and-accumulation (MAC) operations in the ReRAM crossbar arrays. However, prior works cannot reuse activations in computation, in which the activation dominates the data movements and raises significant energy cost. To deal with this dilemma, a tiling-based dataflow is proposed to enable activation reuse among adjacent ReRAM crossbar arrays to reduce the activation movements. We then develop a ReRAM-based CNN accelerator that can well suit the dataflow to reduce the cost of ReRAM access. Evaluation results show that the proposed design achieves 1.8× energy saving and 2.8× bandwidth saving compared with a state-of-the-art PipeLayer accelerator.
机译:本文提出了一种新的电阻随机存取存储器(RERAM)加速器,用于卷积神经网络(CNN)推理加速。在基于Reram的CNN计算中,可以在Reram横杆阵列中预先编程权重参数,并且通过在RerAM横杆阵列中处理乘法和累积(MAC)操作来生成激活。但是,先前的作品不能重用计算中的激活,其中激活主导数据移动并提高了显着的能量成本。为了处理这种困境,提出了一种TILINE的数据流,以便在相邻的RERAM横杆阵列之间启用激活重复使用,以减少激活运动。然后,我们开发了一个基于Reram的CNN加速器,可以很好地平整数据流以降低Reram访问的成本。评价结果表明,与最先进的管道加速器相比,该建议的设计达到1.8倍节能和2.8倍带宽节省。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号