Efficient Implementation of Convolutional Neural Networks with End to End Integer-Only Dataflow

机译：具有端到端纯整数数据流的卷积神经网络的高效实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linear INT8 quantization is presented to construct an end to end integer-only dataflow for efficient inference of modern CNNs. The INT8 method is implemented with unified layer representation, thus quantized CNNs can be partitioned into computation subgraphs consisting of stacked unified layers with simplified integer-only arithmetic flow and scaling back mechanism, indicating high effectiveness for specific hardware realization. Experimental results show that both the classification and object detection models quantized by proposed INT8 method suffer approximate 1% accuracy loss, exhibiting comparable results with TensorRT. As a result, the deep learning accelerator (DLA) with integer-only dataflow and efficient memory hierarchy is designed for CNN applications.

机译：提出了线性INT8量化以构建端到端整数数据流，用于高效推论现代CNN。 INT8方法用统一的层表示来实现，因此量化的CNN可以被划分为由堆叠的统一层的计算子图，其具有简化的整数算术流程和缩放反馈机制，指示特定硬件实现的高效率。实验结果表明，通过提出的INT8方法量化的分类和对象检测模型均近似的1％的精度损耗，表现出与RENSORT的可比结果。因此，设计了具有整数DataFlow和有效内存层级的深度学习加速器（DLA）用于CNN应用。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|1780-1785|共6页
会议地点
作者
Yiwu Yao; Bin Dong; Yuke Li; Weiqiang Yang; Haoqi Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Quantization (signal); Computational modeling; Two dimensional displays; Convolution; Field programmable gate arrays; Training; Computational efficiency;

机译：量化（信号）;计算建模;二维显示;卷积;现场可编程门阵列;训练;计算效率;

相似文献

外文文献
中文文献
专利

1. Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [J] . Yu-Hsin Chen, Joel Emer, Vivienne Sze Computer architecture news . 2016,第3期

机译：Eyeriss：卷积神经网络的节能数据流的空间架构
2. An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS [J] . IEEE transactions on very large scale integration (VLSI) systems . 2020,第1期

机译：节能型深度卷积神经网络推理处理器，具有增强的65nm CMOS输出固定数据流
3. Efficient implementation of convolutional neural networks in the data processing of two-photon in vivo imaging [J] . Wang Yangzhen, Su Feng, Wang Shanshan, Bioinformatics . 2019,第17期

机译：高效地实现卷积神经网络在体内成像中双光子的数据处理
4. EFFICIENT IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORKS WITH END TO END INTEGER-ONLY DATAFLOW [C] . Yiwu Yao, Bin Dong, Yuke Li, IEEE International Conference on Multimedia and Expo . 2019

机译：结束结尾的卷积神经网络的高效实现，以结束Integer only DataFlow
5. Efficient Execution of Convolutional Neural Networks on Low Powered Heterogeneous Systems [D] . Rodrigues, Crefeda Faviola. 2020

机译：高功率异构系统上有效地执行卷积神经网络
6. An Efficient Implementation of Deep Convolutional Neural Networks for MRI Segmentation [O] . Farnaz Hoseini, Asadollah Shahbahrami, Peyman Bayat 2018

机译：深度卷积神经网络用于MRI分割的有效实现
7. Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks [O] . Yu-Hsin Chen, Joel Emer, Vivienne Sze 2017

机译：Eyeriss：用于卷积神经网络的节能数据流的空间架构

Efficient Implementation of Convolutional Neural Networks with End to End Integer-Only Dataflow

摘要

著录项

相似文献

相关主题

期刊订阅