首页> 外文会议>IEEE International Solid- State Circuits Conference >9.2 A 28nm 12.1TOPS/W Dual-Mode CNN Processor Using Effective-Weight-Based Convolution and Error-Compensation-Based Prediction
【24h】

9.2 A 28nm 12.1TOPS/W Dual-Mode CNN Processor Using Effective-Weight-Based Convolution and Error-Compensation-Based Prediction

机译:9.2使用基于有效权重的卷积和基于错误补偿的预测的28nm 12.1tops / W双模CNN处理器

获取原文

摘要

To deploy convolutional neural networks (CNNs) on edge devices efficiently, most existing CNN processors were built on quantized CNNs to optimize the inference operations. However, three issues (Fig. 9.2.1) have not been well addressed: 1) Duplicate weights in each kernel after quantization yielding repetitive multiplications; 2) a huge number of unnecessary MACs caused by ReLU activation functions; 3) frequent off-chip memory access in residual blocks.
机译:为了有效地部署在边缘设备上的卷积神经网络(CNNS),大多数现有的CNN处理器都构建在量化的CNN上,以优化推理操作。然而,三个问题(图9.2.1)尚未得到很好的解决:1)在量化重复乘法的量化之后每个内核中的重复重量; 2)由Relu激活功能引起的大量不必要的MAC; 3)频繁的残余块内的片外内存访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号