首页> 外文OA文献 >Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

【2h】

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

机译：神经网络的量化与训练高效整数算术推断

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The rising popularity of intelligent mobile devices and the dauntingcomputational cost of deep learning-based models call for efficient andaccurate on-device inference schemes. We propose a quantization scheme thatallows inference to be carried out using integer-only arithmetic, which can beimplemented more efficiently than floating point inference on commonlyavailable integer-only hardware. We also co-design a training procedure topreserve end-to-end model accuracy post quantization. As a result, the proposedquantization scheme improves the tradeoff between accuracy and on-devicelatency. The improvements are significant even on MobileNets, a model familyknown for run-time efficiency, and are demonstrated in ImageNet classificationand COCO detection on popular CPUs.

机译：智能移动设备的普及和基于深度学习的模型的令人生畏的推出成本呼叫高效和准确的设备推理方案。我们提出了一种量化方案，可以使用仅限整数算术来执行推断的量化方案，该算法可以比浮点推断更有效地对普遍的整数整数硬件进行更有效。我们还共同设计了培训程序TOPRESERVE端到端模型精度后的量化。因此，PropedQualization方案改善了准确性和脱位之间的权衡。即使在移动时间效率的模型小册子上，也是显着的，即用于运行时效率的模型小，并且在Imagenet分类和Coco对流行的CPU上进行了证明。

著录项

作者
Benoit Jacob; Skirmantas Kligys; Bo Chen; Menglong Zhu; Matthew Tang; Andrew Howard; Hartwig Adam; Dmitry Kalenichenko;
展开▼
作者单位

展开▼
年度 2018
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Efficient spiking neural network training and inference with reduced precision memory and computing [J] . Wang Yi, Shahbazi Karim, Zhang Hao, Computers & Digital Techniques, IET . 2019,第5期

机译：高效的尖峰神经网络训练和推理，并减少了精确的内存和计算
2. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer [J] . Sanghyun Seo, Juntae Kim Applied Sciences . 2019,第12期

机译：基于基于核密度估计的非统一量化器的高效权重量化卷积神经网络
3. Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks [J] . Shu-Chang Zhou, Yu-Zhi Wang, He Wen, 计算机科学技术学报（英文版） . 2017,第004期

机译：平衡量化：一种有效的量化神经网络方法
4. Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference [C] . Benoit Jacob, Skirmantas Kligys, Bo Chen, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：神经网络的量化和训练，以便进行有效的仅整数运算
5. Algorithmic Techniques Towards Efficient Quantization of Deep Neural Networks [D] . ?Youssef, Ahmed 2020

机译：深神经网络有效量化的算法技术
6. Efficient probabilistic inference in generic neural networks trained with non-probabilistic feedback [O] . A. Emin Orhan, Wei Ji Ma -1

机译：经过非概率反馈训练的通用神经网络中的有效概率推理
7. Low-bit Quantization of Neural Networks for Efficient Inference [O] . Yoni Choukroun, Eli Kravchik, Fan Yang, 2019

机译：用于高效推论的神经网络的低位量化

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅