【24h】

RED: A ReRAM-based Deconvolution Accelerator

机译:RED:基于ReRAM的解卷积加速器

获取原文

摘要

Deconvolution has been widespread in neural networks. For example, it is essential for performing unsupervised learning in generative adversarial networks or constructing fully convolutional networks for semantic segmentation. Resistive RAM (ReRAM)-based processing-in-memory architecture has been widely explored in accelerating convolutional computation and demonstrates good performance. Performing deconvolution on existing ReRAM-based accelerator designs, however, suffers from long latency and high energy consumption because deconvolutional computation includes not only convolution but also extra add-on operations. To realize the more efficient execution for deconvolution, we analyze its computation requirement and propose a ReRAM-based accelerator design, namely, RED. More specific, RED integrates two orthogonal methods, the pixel-wise mapping scheme for reducing redundancy caused by zero-inserting operations and the zero-skipping data flow for increasing the computation parallelism and therefore improving performance. Experimental evaluations show that compared to the state-of- the-art ReRAM-based accelerator, RED can speed up operation 3.69~31.15× and reduce 8%~88.36% energy consumption.
机译:反卷积已在神经网络中广泛传播。例如,对于在生成对抗网络中执行无监督学习或构建用于语义分割的完全卷积网络而言,这是必不可少的。基于电阻RAM(ReRAM)的内存中处理架构在加速卷积计算方面得到了广泛的探索,并显示出良好的性能。但是,在现有的基于ReRAM的加速器设计上执行反卷积会带来较长的等待时间和高能耗,因为反卷积计算不仅包括卷积,而且还包括额外的附加操作。为了实现反卷积的更有效执行,我们分析了其计算需求,并提出了基于ReRAM的加速器设计,即RED。更具体地说,RED集成了两种正交方法,一种是逐像素映射方案,用于减少由零插入操作引起的冗余;另一种是零跳跃数据流,用于提高计算并行度,从而提高性能。实验评估表明,与最先进的基于ReRAM的加速器相比,RED可以加速运行3.69〜31.15x,并减少8%〜88.36%的能耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号