首页> 外文会议>ACM/IEEE Annual International Symposium on Computer Architecture >ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars
【24h】

ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars

机译:ISAAC:卷积神经网络加速器,跨越式模拟算术

获取原文

摘要

A number of recent efforts have attempted to design accelerators for popular machine learning algorithms, such as those involving convolutional and deep neural networks (CNNs and DNNs). These algorithms typically involve a large number of multiply-accumulate (dot-product) operations. A recent project, DaDianNao, adopts a near data processing approach, where a specialized neural functional unit performs all the digital arithmetic operations and receives input weights from adjacent eDRAM banks. This work explores an in-situ processing approach, where memristor crossbar arrays not only store input weights, but are also used to perform dot-product operations in an analog manner. While the use of crossbar memory as an analog dot-product engine is well known, no prior work has designed or characterized a full-fledged accelerator based on crossbars. In particular, our work makes the following contributions: (i) We design a pipelined architecture, with some crossbars dedicated for each neural network layer, and eDRAM buffers that aggregate data between pipeline stages. (ii) We define new data encoding techniques that are amenable to analog computations and that can reduce the high overheads of analog-to-digital conversion (ADC). (iii) We define the many supporting digital components required in an analog CNN accelerator and carry out a design space exploration to identify the best balance of memristor storage/compute, ADCs, and eDRAM storage on a chip. On a suite of CNN and DNN workloads, the proposed ISAAC architecture yields improvements of 14.8×, 5.5×, and 7.5× in throughput, energy, and computational density (respectively), relative to the state-of-the-art DaDianNao architecture.
机译:最近的一些努力试图设计流行的机器学习算法的加速器,例如涉及卷积和深神经网络(CNN和DNN)的那些。这些算法通常涉及大量的乘积(DOT-Maply)操作。最近的一个项目Dadiannao采用近数据处理方法,其中专用神经功能单元执行所有数字算术运算并从相邻的EDRAM银行接收输入权重。这项工作探讨了原位处理方法,其中Memristor CrossBar阵列不仅存储输入权重,而且还用于以模拟方式执行点产品操作。虽然使用横杆存储器作为模拟点 - 产品发动机,但众所周知,没有先前的工作设计或表征基于横杆的全叶加速器。特别是,我们的工作提出了以下贡献:(i)我们设计了一种流水线架构,其中一些跨栏专用于每个神经网络层,以及在管道阶段之间聚合数据的EDRAM缓冲区。 (ii)我们定义了适用于模拟计算的新数据编码技术,并且可以减少模数转换(ADC)的高开销。 (iii)我们定义了模拟CNN加速器中所需的许多支持数字组件,并执行设计空间探索,以识别芯片上的Memitror存储/计算,ADC和EDRAM存储的最佳平衡。在CNN和DNN工作负载的套件上,拟议的ISAAC架构在吞吐量,能量和计算密度(分别)相对于最先进的Dadiannao架构产生了14.8倍,5.5倍和7.5倍的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号