Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification

机译：用于ImageNet分类的可重构逻辑上减少冗余的MobileNet加速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Modern Convolutional Neural Networks (CNNs) excel in image classification and recognition applications on large-scale datasets such as ImageNet, compared to many conventional feature-based computer vision algorithms. However, the high computational complexity of CNN models can lead to low system performance in power-efficient applications. In this work, we firstly highlight two levels of model redundancy which widely exist in modern CNNs. Additionally, we use MobileNet as a design example and propose an efficient system design for a Redundancy-Reduced MobileNet (RR-MobileNet) in which off-chip memory traffic is only used for inputs/outputs transfer while parameters and intermediate values are saved in on-chip BRAM blocks. Compared to AlexNet, our RR-mobileNet has 25× less parameters, 3.2× less operations per image inference but 9%/5.2% higher Topl/Top5 classification accuracy on ImageNet classification task. The latency of a single image inference is only 7.85 ms.

机译：与许多传统的基于特征的计算机视觉算法相比，现代卷积神经网络（CNN）在大型图像数据集（如ImageNet）的图像分类和识别应用中表现出色。但是，CNN模型的高计算复杂性会导致在节电应用中系统性能低下。在这项工作中，我们首先强调现代CNN中广泛存在的两个模型冗余级别。此外，我们将MobileNet用作设计示例，并为减少冗余的MobileNet（RR-MobileNet）提出了一种有效的系统设计，其中片外存储器流量仅用于输入/输出传输，而参数和中间值保存在其中。芯片的BRAM块。与AlexNet相比，我们的RR-mobileNet的参数减少了25倍，每个图像推断的操作减少了3.2倍，但是ImageNet分类任务的Topl / Top5分类准确度提高了9％/ 5.2％。单个图像推断的等待时间仅为7.85毫秒。

著录项

来源
《International symposium on applied reconfigurable computing》|2018年|16-28|共13页
会议地点
作者
Jiang Su; Julian Faraone; Junyi Liu; Yiren Zhao; David B. Thomas; Philip H. W. Leong; Peter Y. K. Cheung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Pruning; Quantization; CNN; FPGA Algorithm acceleration;

机译：修剪;量化; CNN; FPGA算法加速;

相似文献

外文文献
中文文献
专利

1. Optical configuration acceleration on a new optically reconfigurable gate array very large scale integration using a negative logic implementation [J] . Retsu Moriwaki, Minoru Watanabe Applied optics . 2013,第9期

机译：使用负逻辑实现在新型光学可重配置门阵列上进行大规模大规模集成的光学配置加速
2. Functions classification approach to generate reconfigurable fine-grain logic based on Ambipolar Independent Double Gate FET (Am-IDGFET) [J] . K. Jabeur, I. OConnor, N. Yakymets Microelectronics journal . 2013,第12期

机译：功能分类方法基于双极性独立双栅极FET（Am-IDGFET）生成可重新配置的细粒度逻辑
3. AR and ARMA model order selection for time-series modeling with ImageNet classification [J] . Jihye Moon, Billal Hossain, Ki H. Chon Signal processing . 2021,第Juna期

机译：AR和ARMA模型订单选择与Imagenet分类的时间序列建模
4. Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification [C] . Jiang Su, Julian Faraone, Junyi Liu, International symposium on applied reconfigurable computing . 2018

机译：冗余减少的MobileNet在可重构逻辑上进行Imagenet分类的加速
5. ImageNet Classification with Complementary Networks [D] . Zhu, Zhuotun. 2016

机译：具有互补网络的ImageNet分类
6. A New Image Classification Approach via Improved MobileNet Models with Local Receptive Field Expansion in Shallow Layers [O] . Wei Wang, Yiyang Hu, Ting Zou, 2020

机译：一种新的图像分类方法通过改进的MobileNet模型浅层中的局部接受场扩展
7. Video Processing Acceleration using Reconfigurable Logic and Graphics Processors [O] . Cope Benjamin Thomas 2008

机译：使用可重配置逻辑和图形处理器的视频处理加速

Redundancy-Reduced MobileNet Acceleration on Reconfigurable Logic for ImageNet Classification

摘要

著录项

相似文献

相关主题

期刊订阅