Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators

Wang Weijia; Lin Bill

首页> 外文期刊>ACM Journal on Emerging Technologies in Computing Systems >Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators

【24h】

Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators

机译：基于Reram的神经网络加速器的训练有素的偏见数字表示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent works have demonstrated the promise of using resistive random access memory (ReRAM) to perform neural network computations in memory. In particular, ReRAM-based crossbar structures can perform matrix-vector multiplication directly in the analog domain, but the resolutions of ReRAM cells and digital/analog converters limit the precisions of inputs and weights that can be directly supported. Although convolutional neural networks (CNNs) can be trained with low-precision weights and activations, previous quantization approaches are either not amenable to ReRAM-based crossbar implementations or have poor accuracies when applied to deep CNNs on complex datasets. In this article, we propose a new CNN training and implementation approach that implements weights using a trained biased number representation, which can achieve near full-precision model accuracy with as little as 2-bit weights and 2-bit activations on the CIFAR datasets. The proposed approach is compatible with a ReRAM-based crossbar implementation. We also propose an activation-side coalescing technique that combines the steps of batch normalization, nonlinear activation, and quantization into a single stage that simply performs a clipped-rounding operation. Experiments demonstrate that our approach outperforms previous low-precision number representations for VGG-11, VGG-13, and VGG-19 models on both the CIFAR-10 and CIFAR-100 datasets.

机译：最近的作品已经证明了使用电阻随机存取存储器（RERAM）在内存中执行神经网络计算的承诺。特别地，基于reram的横杆结构可以直接在模拟域中执行矩阵矢量乘法，但RerAM单元和数字/模拟转换器的分辨率限制了可以直接支持的输入和权重的精确。尽管可以用低精度的重量和激活训练卷积神经网络（CNNS），但是先前的量化方法不适用于基于reram的横杆实施方式，或者在复杂数据集上应用于深度CNN时具有差的准确度。在本文中，我们提出了一种新的CNN培训和实施方法，实现使用训练有素的偏置数字表示来实现权重，这可以在CIFAR数据集中具有近2位权重和2位激活的近全精度模型精度。所提出的方法与基于reram的横杆实现兼容。我们还提出了一种激活侧聚结技术，该技术将批量归一化，非线性激活和量化的步骤结合到简单地执行剪裁操作的单个阶段。实验表明，我们的方法在CIFAR-10和CIFAR-100数据集中占据了VGG-11，VGG-13和VGG-19模型的先前低精度数表示。

著录项

来源
《ACM Journal on Emerging Technologies in Computing Systems》 |2019年第2期|共17页
作者
Wang Weijia; Lin Bill;
展开▼
作者单位

Univ Calif San Diego Dept Elect &

Comp Engn La Jolla CA 92093 USA;

Univ Calif San Diego Dept Elect &

Comp Engn La Jolla CA 92093 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Resistive Memory; convolutional neural networks; quantization; machine learning; processing-in-memory;

机译：电阻记忆;卷积神经网络;量化;机器学习;加工内存;

相似文献

外文文献
中文文献
专利

1. Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators [J] . Wang Weijia, Lin Bill ACM Journal on Emerging Technologies in Computing Systems . 2019,第2期

机译：基于Reram的神经网络加速器的训练有素的偏见数字表示
2. A Reduced Architecture for ReRAM-Based Neural Network Accelerator and Its Software Stack [J] . Ji Yu, Liu Zixin, Zhang Youhui IEEE Transactions on Computers . 2021,第3期

机译：基于Reram的神经网络加速器及其软件堆栈的减少架构
3. PDE-NetGen 1.0: from symbolic partial differential equation (PDE) representations of physical processes to trainable neural network representations [J] . Pannekoucke Olivier, Fablet Ronan Geoscientific Model Development Discussions . 2020,第7期

机译：PDE-Netgen 1.0：从符号部分微分方程（PDE）物理流程表示到可训练的神经网络表示
4. A Quantized Training Framework for Robust and Accurate ReRAM-based Neural Network Accelerators [C] . Chenguang Zhang, Pingqiang Zhou Asia and South Pacific Design Automation Conference . 2021

机译：基于鲁棒和精确的基于RERAM的神经网络加速器的量化训练框架
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. A 181 GOPS AKAZE Accelerator Employing Discrete-Time Cellular Neural Networks for Real-Time Feature Extraction [O] . Guangli Jiang, Leibo Liu, Wenping Zhu, 2015

机译：采用离散时间细胞神经网络的181 GOPS AKAZE加速器用于实时特征提取
7. Extracting Finite State Representations from Recurrent Neural Networks trained on Chaotic Symbolic Sequences [O] . Peter Tino, Miroslav Köteles 1999

机译：从在混沌符号序列上训练的递归神经网络中提取有限状态表示

Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅