首页> 外国专利> NEURAL NETWORK MODEL COMPRESSION METHOD AND APPARATUS, AND COMPUTER DEVICE

NEURAL NETWORK MODEL COMPRESSION METHOD AND APPARATUS, AND COMPUTER DEVICE

机译:神经网络模型的压缩方法,装置及计算机装置

摘要

The present application provides a neural network model compression method and apparatus, and a computer device. The neural network model compression method provided in the present application comprises: decomposing, for each of original convolution layers of a neural network model to be compressed, the original convolution layers into a plurality of cascaded target convolution layers; acquiring a first convolution processing result after the original convolution layers perform convolution processing of inputted data, and a second convolution processing result after the plurality of cascaded target convolution layers perform convolution processing of the inputted data sequentially; correcting, according to the first convolution processing result and the second convolution processing result, weight matrixes of the plurality of cascaded target convolution layers, so as to obtain weight matrixes after the original convolution layers are compressed; and obtaining a compressed neural network model according to the weight matrixes after the original convolution layers are compressed.
机译:本申请提供了一种神经网络模型压缩方法和装置,以及一种计算机设备。本申请提供的神经网络模型压缩方法,包括:对于待压缩的神经网络模型的每个原始卷积层,将原始卷积层分解为多个级联目标卷积层;在原始卷积层对输入数据进行卷积处理后,获取第一卷积处理结果;多个级联目标卷积层对输入数据依次进行卷积处理后,获取第二卷积处理结果;根据第一卷积处理结果和第二卷积处理结果,对多个级联目标卷积层的权重矩阵进行校正,得到原始卷积层被压缩后的权重矩阵。原始卷积层经过压缩后,根据权重矩阵得到压缩神经网络模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号