首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Quantized Convolutional Neural Networks for Mobile Devices
【24h】

Quantized Convolutional Neural Networks for Mobile Devices

机译:移动设备的量化卷积神经网络

获取原文

摘要

Recently, convolutional neural networks (CNN) have demonstrated impressive performance in various computer vision tasks. However, high performance hardware is typically indispensable for the application of CNN models due to the high computation complexity, which prohibits their further extensions. In this paper, we propose an efficient framework, namely Quantized CNN, to simultaneously speed-up the computation and reduce the storage and memory overhead of CNN models. Both filter kernels in convolutional layers and weighting matrices in fully-connected layers are quantized, aiming at minimizing the estimation error of each layer's response. Extensive experiments on the ILSVRC-12 benchmark demonstrate 4 ~ 6× speed-up and 15 ~ 20× compression with merely one percentage loss of classification accuracy. With our quantized CNN model, even mobile devices can accurately classify images within one second.
机译:最近,卷积神经网络(CNN)在各种计算机视觉任务中已展示出令人印象深刻的性能。然而,由于高计算复杂性,高性能硬件对于CNN模型的应用通常是必不可少的,这阻碍了它们的进一步扩展。在本文中,我们提出了一个有效的框架,即量化CNN,以同时加快计算速度并减少CNN模型的存储和内存开销。卷积层中的两个滤波内核和全连接层中的加权矩阵都被量化,目的是使每层响应的估计误差最小。在ILSVRC-12基准上进行的大量实验表明,加速4到6倍,压缩15到20倍,而分类精度却只损失了一个百分点。借助我们的量化CNN模型,即使移动设备也可以在一秒钟内准确地对图像进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号