首页> 外文会议>International symposium on applied reconfigurable computing >Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification
【24h】

Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification

机译:冗余减少的MobileNet在可重构逻辑上进行Imagenet分类的加速

获取原文
获取外文期刊封面目录资料

摘要

Modern Convolutional Neural Networks (CNNs) excel in image classification and recognition applications on large-scale datasets such as ImageNet, compared to many conventional feature-based computer vision algorithms. However, the high computational complexity of CNN models can lead to low system performance in power-efficient applications. In this work, we firstly highlight two levels of model redundancy which widely exist in modern CNNs. Additionally, we use MobileNet as a design example and propose an efficient system design for a Redundancy-Reduced MobileNet (RR-MobileNet) in which off-chip memory traffic is only used for inputs/outputs transfer while parameters and intermediate values are saved in on-chip BRAM blocks. Compared to AlexNet, our RR-mobileNet has 25× less parameters, 3.2× less operations per image inference but 9%/5.2% higher Topl/Top5 classification accuracy on ImageNet classification task. The latency of a single image inference is only 7.85 ms.
机译:与许多传统的基于特征的计算机视觉算法相比,现代卷积神经网络(CNNS)在图像分类和识别应用中的图像分类和识别应用中的识别应用。然而,CNN型号的高计算复杂性可能导致高功率高效应用中的系统性能。在这项工作中,我们首先突出了两级的模型冗余,在现代CNN中广泛存在。此外,我们使用MobileNet作为设计示例,并提出了一种有效的系统设计,用于冗余减少的MobileNet(RR-Mobilenet),其中片外存储器流量仅用于输入/输出传输,而参数和中间值保存在on上-Chip Bram块。与AlexNet相比,我们的RR-MobiLenet参数有25倍,每次图像推理的3.2倍较少,但Topenet分类任务上的TOPL / TOP5分类准确度高9%/ 5.2%。单个图像推断的延迟仅为7.85毫秒。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号