首页> 外文会议>International Conference on Human Computer Interactions >A Summary of convolution Neural Network Compression and Acceleration Technology
【24h】

A Summary of convolution Neural Network Compression and Acceleration Technology

机译:卷积神经网络压缩与加速技术概述

获取原文

摘要

Although convolution neural network has achieved remarkable results in different application scenarios, there are a large number of parameters and computation in its structure, which limit its development in mobile and embedded devices. How to reduce parameters, compress model and optimize structure to improve network performance without losing accuracy has become a hot issue of convolution neural network. This paper summarizes and summarizes the convolution neural network structure optimization technology from five aspects: granularity pruning, weight quantization sharing, knowledge distillation, tensor decomposition and fine network design, and analyzes the technical core of it. Their advantages and disadvantages, applicable scenarios and optimization results are analyzed and summarized respectively, and the future research direction is prospected.
机译:虽然卷积神经网络在不同的应用场景中取得了显着的结果,但其结构中存在大量参数和计算,这限制了其在移动和嵌入式设备中的开发。 如何减少参数,压缩模型和优化结构,以提高网络性能而不会失去准确度已成为卷积神经网络的热门问题。 本文总结了五个方面的卷积神经网络结构优化技术:粒度修剪,重量量化共享,知识蒸馏,张量分解和精细网络设计,分析了它的技术核心。 分别分析并汇总了它们的优缺点,适用的情景和优化结果,未来的研究方向被展望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号