...
首页> 外文期刊>ACM Journal on Emerging Technologies in Computing Systems >Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators
【24h】

Trained Biased Number Representation for ReRAM-Based Neural Network Accelerators

机译:基于Reram的神经网络加速器的训练有素的偏见数字表示

获取原文
获取原文并翻译 | 示例
           

摘要

Recent works have demonstrated the promise of using resistive random access memory (ReRAM) to perform neural network computations in memory. In particular, ReRAM-based crossbar structures can perform matrix-vector multiplication directly in the analog domain, but the resolutions of ReRAM cells and digital/analog converters limit the precisions of inputs and weights that can be directly supported. Although convolutional neural networks (CNNs) can be trained with low-precision weights and activations, previous quantization approaches are either not amenable to ReRAM-based crossbar implementations or have poor accuracies when applied to deep CNNs on complex datasets. In this article, we propose a new CNN training and implementation approach that implements weights using a trained biased number representation, which can achieve near full-precision model accuracy with as little as 2-bit weights and 2-bit activations on the CIFAR datasets. The proposed approach is compatible with a ReRAM-based crossbar implementation. We also propose an activation-side coalescing technique that combines the steps of batch normalization, nonlinear activation, and quantization into a single stage that simply performs a clipped-rounding operation. Experiments demonstrate that our approach outperforms previous low-precision number representations for VGG-11, VGG-13, and VGG-19 models on both the CIFAR-10 and CIFAR-100 datasets.
机译:最近的作品已经证明了使用电阻随机存取存储器(RERAM)在内存中执行神经网络计算的承诺。特别地,基于reram的横杆结构可以直接在模拟域中执行矩阵矢量乘法,但RerAM单元和数字/模拟转换器的分辨率限制了可以直接支持的输入和权重的精确。尽管可以用低精度的重量和激活训练卷积神经网络(CNNS),但是先前的量化方法不适用于基于reram的横杆实施方式,或者在复杂数据集上应用于深度CNN时具有差的准确度。在本文中,我们提出了一种新的CNN培训和实施方法,实现使用训练有素的偏置数字表示来实现权重,这可以在CIFAR数据集中具有近2位权重和2位激活的近全精度模型精度。所提出的方法与基于reram的横杆实现兼容。我们还提出了一种激活侧聚结技术,该技术将批量归一化,非线性激活和量化的步骤结合到简单地执行剪裁操作的单个阶段。实验表明,我们的方法在CIFAR-10和CIFAR-100数据集中占据了VGG-11,VGG-13和VGG-19模型的先前低精度数表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号