首页> 外文会议>International symposium on applied reconfigurable computing >Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification
【24h】

Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification

机译:用于ImageNet分类的可重构逻辑上减少冗余的MobileNet加速

获取原文

摘要

Modern Convolutional Neural Networks (CNNs) excel in image classification and recognition applications on large-scale datasets such as ImageNet, compared to many conventional feature-based computer vision algorithms. However, the high computational complexity of CNN models can lead to low system performance in power-efficient applications. In this work, we firstly highlight two levels of model redundancy which widely exist in modern CNNs. Additionally, we use MobileNet as a design example and propose an efficient system design for a Redundancy-Reduced MobileNet (RR-MobileNet) in which off-chip memory traffic is only used for inputs/outputs transfer while parameters and intermediate values are saved in on-chip BRAM blocks. Compared to AlexNet, our RR-mobileNet has 25× less parameters, 3.2× less operations per image inference but 9%/5.2% higher Topl/Top5 classification accuracy on ImageNet classification task. The latency of a single image inference is only 7.85 ms.
机译:与许多传统的基于特征的计算机视觉算法相比,现代卷积神经网络(CNN)在大型图像数据集(如ImageNet)的图像分类和识别应用中表现出色。但是,CNN模型的高计算复杂性会导致在节电应用中系统性能低下。在这项工作中,我们首先强调现代CNN中广泛存在的两个模型冗余级别。此外,我们将MobileNet用作设计示例,并为减少冗余的MobileNet(RR-MobileNet)提出了一种有效的系统设计,其中片外存储器流量仅用于输入/输出传输,而参数和中间值保存在其中。芯片的BRAM块。与AlexNet相比,我们的RR-mobileNet的参数减少了25倍,每个图像推断的操作减少了3.2倍,但是ImageNet分类任务的Topl / Top5分类准确度提高了9%/ 5.2%。单个图像推断的等待时间仅为7.85毫秒。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号