首页> 外国专利> Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization

Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization

机译:通过硬件优化对CNN层进行变换以优化CNN参数量化的方法和装置,以用于高精度的移动设备或紧凑型网络

摘要

There is provided a method for transforming convolutional layers of a CNN including m convolutional blocks to optimize CNN parameter quantization to be used for mobile devices, compact networks, and the like with high precision via hardware optimization. The method includes steps of: a computing device (a) generating k-th quantization loss values by referring to k-th initial weights of a k-th initial convolutional layer included in a k-th convolutional block, a (k−1)-th feature map outputted from the (k−1)-th convolutional block, and each of k-th scaling parameters; (b) determining each of k-th optimized scaling parameters by referring to the k-th quantization loss values; (c) generating a k-th scaling layer and a k-th inverse scaling layer by referring to the k-th optimized scaling parameters; and (d) transforming the k-th initial convolutional layer into a k-th integrated convolutional layer by using the k-th scaling layer and the (k−1)-th inverse scaling layer.
机译:提供了一种用于通过硬件优化来对包括m个卷积块的CNN的卷积层进行变换以优化用于移动设备,紧凑网络等的CNN参数量化的方法。该方法包括以下步骤:计算设备(a)通过参考包括在第k个卷积块中的第k个初始卷积层的第k个初始权重来生成第k个量化损失值,(k-1)从第(k-1)个卷积块输出的第th个特征图和第k个缩放参数中的每一个; (b)通过参考第k个量化损失值来确定第k个最优缩放参数; (c)通过参考第k个优化的缩放参数,生成第k个缩放层和第k个逆缩放层; (d)通过使用第k缩放比例层和第(k-1)逆缩放比例层,将第k初始卷积层转换为第k综合卷积层。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号