首页> 外文会议>ACM/IEEE Design Automation Conference >PattPIM: A Practical ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions
【24h】

PattPIM: A Practical ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions

机译:PattPIM:一种实用的基于ReRAM的DNN加速器,通过重用权重模式重复

获取原文

摘要

Weight sparsity has been explored to achieve energy efficiency for Resistive Random-access Memory (ReRAM) based DNN accelerators. However, most existing ReRAM-based DNN accelerators are based on an overidealized crossbar architecture and mainly focus on compressing zero weights. In this paper, we propose a novel ReRAM-based accelerator — PattPIM, to achieve space compression and computation reuse by studying DNN weight patterns based on practical ReRAM crossbars. We first thoroughly analyze the weight distribution characteristics of several typical DNN models and observe many non-zero weight pattern repetitions (WPRs). Thus, in PattPIM, we propose a WPR-aware DNN engine and a WPR-to-OU mapping scheme to save both space and computation resources. Furthermore, we adopt an approximate weight pattern transform algorithm to improve the DNN WPRs ratio to enhance the reuse efficiency with negligible inference accuracy loss. Our evaluation with 6 DNN models shows that the proposed PattPIM delivers significant performance improvement, ReRAM resources efficiency and energy saving.
机译:已经探索了重量稀疏性,以实现基于电阻式随机存取存储器(ReRAM)的DNN加速器的能效。但是,大多数现有的基于ReRAM的DNN加速器都基于过分理想的交叉开关体系结构,并且主要集中在压缩零权重上。在本文中,我们提出了一种新颖的基于ReRAM的加速器PattPIM,通过研究基于实际ReRAM交叉开关的DNN权重模式来实现空间压缩和计算重用。我们首先彻底分析几种典型DNN模型的权重分布特征,并观察许多非零权重模式重复(WPR)。因此,在PattPIM中,我们提出了一种WPR感知DNN引擎和WPR到OU映射方案,以节省空间和计算资源。此外,我们采用近似权重模式变换算法来提高DNN WPR比率,以提高重用效率,而推理精度损失可忽略不计。我们对6种DNN模型的评估表明,提出的PattPIM可以显着改善性能,ReRAM资源效率和节能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号