首页> 外国专利> A SYSTEM AND A METHOD FOR NON-UNIFORM QUANTIZATION OF PRE-TRAINED DEEP NEURAL NETWORK

A SYSTEM AND A METHOD FOR NON-UNIFORM QUANTIZATION OF PRE-TRAINED DEEP NEURAL NETWORK

机译:预训练深层神经网络非均匀量化的系统和方法

摘要

The present invention relates to a system and a method for quantizing a pre-trained neural network. The method for quantizing a pre-trained neural network comprises: determining minimum quantization noise for a layer or channel for each master bit-width value in a predetermined set of master bit-width values, by a layer/channel bit-width determiner for a layer or channel of each of pre-trained neural networks; and selecting the master bit-width value having the minimum quantization noise for the layer or channel, by a bit-width selector for the layer or channel. In an embodiment of the present invention, the minimum quantization noise is based on multiplying the square of a range of weights for the layer or channel by a constant that is a negative exponent of a current master bit-width value.
机译:本发明涉及一种用于量化预训练神经网络的系统和方法。用于量化预训练的神经网络的方法包括:通过层/通道位宽确定器确定用于层或通道的最小量化噪声,该层或通道针对预定的一组主位宽值中的每个主位宽值。每个预训练神经网络的层或通道;通过层或通道的位宽选择器选择具有最小量化噪声的层或通道的主位宽值。在本发明的实施例中,最小量化噪声基于层或信道的权重范围的平方乘以作为当前主位宽度值的负指数的常数。

著录项

  • 公开/公告号KR20200035198A

    专利类型

  • 公开/公告日2020-04-02

    原文格式PDF

  • 申请/专利权人 SAMSUNG ELECTRONICS CO. LTD.;

    申请/专利号KR20190056689

  • 发明设计人 CHEN HUI;OVSIANNIKOV ILIA;

    申请日2019-05-15

  • 分类号G06N3/063;G06N3/04;

  • 国家 KR

  • 入库时间 2022-08-21 11:07:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号