首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Quantized Convolutional Neural Networks for Mobile Devices
【24h】

Quantized Convolutional Neural Networks for Mobile Devices

机译:用于移动设备的量化卷积神经网络

获取原文

摘要

Recently, convolutional neural networks (CNN) have demonstrated impressive performance in various computer vision tasks. However, high performance hardware is typically indispensable for the application of CNN models due to the high computation complexity, which prohibits their further extensions. In this paper, we propose an efficient framework, namely Quantized CNN, to simultaneously speed-up the computation and reduce the storage and memory overhead of CNN models. Both filter kernels in convolutional layers and weighting matrices in fully-connected layers are quantized, aiming at minimizing the estimation error of each layer's response. Extensive experiments on the ILSVRC-12 benchmark demonstrate 4 ~ 6× speed-up and 15 ~ 20× compression with merely one percentage loss of classification accuracy. With our quantized CNN model, even mobile devices can accurately classify images within one second.
机译:最近,卷积神经网络(CNN)在各种计算机视觉任务中表现出令人印象深刻的性能。然而,由于高计算复杂性,高性能硬件通常对于CNN模型的应用是必不可少的,这禁止其进一步的扩展。在本文中,我们提出了一种有效的框架,即量化的CNN,同时加速计算并减少CNN模型的存储和存储器开销。量化卷积层中的滤波器内核和全连接层中的加权矩阵都被量化,旨在最小化每层响应的估计误差。在ILSVRC-12基准上的广泛实验表明4〜6倍加速和15〜20×压缩,仅限分类精度一百分比损失。利用我们量化的CNN模型,即使是移动设备也可以在一秒钟内准确地分类图像。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号