首页> 外文会议>2018 IEEE 23rd International Conference on Digital Signal Processing >Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights
【24h】

Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights

机译:具有二进制权重的基于FPGA的深度卷积神经网络的节能架构

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an energy-efficient, deep parallel Convolutional Neural Network (CNN) accelerator. By adopting a recently proposed binary weight method, the CNN computations are converted into multiplication-free processing. To allow parallel accessing and storing of data, we use two RAM banks, where each bank is composed of NRAM blocks corresponding to N-parallel processing. We also design a reconfigurable CNN computing unit in a divide-and-reuse to support a variable-size convolutional filter. Compared with full-precision computing on the MNIST and CIFAR-10 classification tasks, the inference Top-1 accuracy of the binary weight CNN has dropped by 1.21% and 1.34%, respectively. The hardware implementation results show that the proposed design can achieve 2100 GOPs with a 4.6 millisecond processing latency. The deep parallel accelerator exhibits 3X energy efficiency compared to a GPU-based design.
机译:本文提出了一种高能效,深度并行卷积神经网络(CNN)加速器。通过采用最近提出的二进制权重方法,CNN计算被转换为无乘法处理。为了允许并行访问和存储数据,我们使用两个RAM库,其中每个库由对应于N并行处理的NRAM块组成。我们还设计了一种可重配置的CNN计算单元,以支持重复使用,以支持可变大小的卷积滤波器。与针对MNIST和CIFAR-10分类任务的高精度计算相比,二元权重CNN的推理Top-1准确性分别降低了1.21%和1.34%。硬件实现结果表明,提出的设计可以实现2100 GOP,处理延迟为4.6毫秒。与基于GPU的设计相比,深度并行加速器具有3倍的能源效率。

著录项

  • 来源
  • 会议地点 Shanghai(CN)
  • 作者单位

    National key Lab. of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu, 611731, China;

    National key Lab. of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu, 611731, China;

    National key Lab. of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu, 611731, China;

    National key Lab. of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu, 611731, China;

    National key Lab. of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu, 611731, China;

    University of Minnesota, Minneapolis, MN, 55455, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Random access memory; Hardware; Registers; Kernel; Energy efficiency; Convolution; Field programmable gate arrays;

    机译:随机存取存储器;硬件;寄存器;内核;能效;卷积;现场可编程门阵列;;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号