Low Bit-Width Convolutional Neural Network on RRAM

Cai Yi; Tang Tianqi; Xia Lixue; Li Boxun; Wang Yu; Yang Huazhong

首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >Low Bit-Width Convolutional Neural Network on RRAM

【24h】

Low Bit-Width Convolutional Neural Network on RRAM

机译：RRAM上的低位宽度卷积神经网络

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The emerging resistive random-access memory (RRAM) has been widely applied in accelerating the computing of deep neural networks. However, it is challenging to achieve high-precision computations based on RRAM due to the limits of the resistance level and the interfaces. Low bit-width convolutional neural networks (CNNs) provide promising solutions to introduce low bit-width RRAM devices and low bit-width interfaces in RRAM-based computing system (RCS). While open questions still remain regarding: 1) how to make matrix splitting when a single crossbar is not large enough to hold all parameters of one weight matrix; 2) how to design a pipeline to accelerate the inference based on line buffer structure; and 3) how to reduce the accuracy drop due to the parameter splitting and data quantization. In this paper, we propose an RRAM crossbar-based low bit-width CNN (LB-CNN) accelerator. We make detailed discussion on the system design, including the matrix splitting strategies to enhance the scalability, and the pipelined implementation based on line buffers to accelerate the inference. In addition, we propose a splitting and quantizing while training method to incorporate the actual hardware constraints with the training. In our experiments, low bit-width LeNet-5 on RRAM show much better robustness than multibit models with device variation. The pipeline strategy achieves approximately 6.0x speedup to process each image on ResNet-18. For low-bit VGG-8 on CIFAR-10, the proposed accelerator saves 54.9% of the energy consumption and 48.3% of the area compared with the multibit VGG-8 structure.

机译：新兴电阻随机存取存储器（RRAM）已广泛应用于加速深度神经网络的计算。然而，由于电阻水平和接口的限制，基于RRAM实现高精度计算是具有挑战性的。低位宽度卷积神经网络（CNNS）提供了有希望的解决方案，以引入基于RRAM的计算系统（RCS）中的低位宽度RRAM器件和低位宽接口。虽然打开问题仍然存在：1）如何在单个横杆不大于足够大以保持一个权重矩阵的所有参数时进行矩阵拆分; 2）如何设计管道以基于行缓冲区结构加速推断; 3）如何降低由于参数分离和数据量化引起的精度下降。在本文中，我们提出了一种基于RRAM横杆的低比特宽度CNN（LB-CNN）加速器。我们详细讨论了系统设计，包括矩阵分裂策略，以增强可伸缩性，以及基于线缓冲器的流水线实现，以加速推断。此外，我们提出了一种在训练方法中融合和量化，以将实际的硬件限制与训练合并。在我们的实验中，RRAM上的低位宽度LENET-5比具有设备变化的多性模型显示出更好的鲁棒性。管道策略达到大约6.0倍的加速度，以在Resnet-18上处理每个图像。对于CIFAR-10上的低位VGG-8，拟议的加速器与多点VGG-8结构相比，该促进剂的能量消耗量为54.9％，48.3％。

著录项

来源
《IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems》 |2020年第7期|1414-1427|共14页
作者
Cai Yi; Tang Tianqi; Xia Lixue; Li Boxun; Wang Yu; Yang Huazhong;
展开▼
作者单位

Tsinghua Univ Beijing Innovat Ctr Future Chips Beijing Natl Res Ctr Informat Sci & Technol Dept Elect Engn Beijing 100084 Peoples R China;

Univ Calif Santa Barbara Dept Elect & Comp Engn Santa Barbara CA 93106 USA;

Alibaba Grp Beijing Dept Cloud Intelligence Beijing 100022 Peoples R China;

Tsinghua Univ Beijing Innovat Ctr Future Chips Beijing Natl Res Ctr Informat Sci & Technol Dept Elect Engn Beijing 100084 Peoples R China;

Tsinghua Univ Beijing Innovat Ctr Future Chips Beijing Natl Res Ctr Informat Sci & Technol Dept Elect Engn Beijing 100084 Peoples R China;

Tsinghua Univ Beijing Innovat Ctr Future Chips Beijing Natl Res Ctr Informat Sci & Technol Dept Elect Engn Beijing 100084 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Pipelines; Training; Neural networks; Resistance; Convolution; Performance evaluation; Neurons; Constrained training; low bit-width convolutional neural network (LB-CNN); parameter splitting; pipeline; resistive random-access memory (RRAM);

机译：管道;训练;抵抗;卷积;绩效评估;神经元;受限训练;低位宽度卷积神经网络（LB-CNN）;参数分裂;管道;电阻随机存取存储器（RRAM）;

相似文献

外文文献
中文文献
专利

1. Convolutional Neural Networks Based on RRAM Devices for Image Recognition and Online Learning Tasks [J] . Zhen Dong, Zheng Zhou, Zefan Li, Electron Devices, IEEE Transactions on . 2019,第1期

机译：基于RRAM设备的卷积神经网络用于图像识别和在线学习任务
2. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
3. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
4. A low bit-width parameter representation method for hardware-oriented convolution neural networks [C] . Qiang Chen, Chen Xin, Chenglong Zou, IEEE International Conference on ASIC . 2017

机译：面向硬件的卷积神经网络的低位宽参数表示方法
5. An Empirical and Theoretical Investigation of Random Reinforced Forests and Shallow Convolutional Neural Networks [D] . Ganta, Nikhil. 2021

机译：随机钢筋林和浅卷积神经网络的实证与理论研究
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection [O] . Penghang Yin 2019

机译：低位宽度卷积神经网络的量化与训练对象检测

Low Bit-Width Convolutional Neural Network on RRAM

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅