首页> 外国专利> METHOD AND ALGORITHM OF RECURSIVE DEEP LEARNING QUANTIZATION FOR WEIGHT BIT REDUCTION

METHOD AND ALGORITHM OF RECURSIVE DEEP LEARNING QUANTIZATION FOR WEIGHT BIT REDUCTION

机译:递归深度学习量化的权重降低方法与算法

摘要

A system for reducing weight storage bits for a deep-learning network and a method thereof include a quantization module and a cluster-number reduction module. The quantization module quantizes neural weights of each quantization layer in a deep-learning network. The cluster-number reduction module reduces a preset number of clusters for a layer having a clustering error which is the minimum value of clustering errors of a plurality of quantization layers. The quantization module performs re-quantization based on a preset number of clusters reduced with respect to the layer. The cluster-number reduction module further determines another layer having a clustering error, which is the minimum value of the clustering errors of the quantized layers. In addition, the cluster-number reduction module reduces the preset number of clusters with respect to another layer until the recognition performance of the deep-learning network is reduced by a preset threshold value.
机译:用于减少深度学习网络的权重存储位的系统及其方法,包括量化模块和簇数减少模块。量化模块在深度学习网络中量化每个量化层的神经权重。群集数目减少模块减少具有群集错误的层的预设群集数目,该群集错误是多个量化层的群集错误的最小值。量化模块基于相对于层减少的预设数目的簇执行重新量化。聚类数减少模块还确定具有聚类误差的另一层,该聚类误差是量化层的聚类误差的最小值。另外,簇数减少模块减少相对于另一层的预设簇数,直到深度学习网络的识别性能降低预设阈值为止。

著录项

  • 公开/公告号KR20180082344A

    专利类型

  • 公开/公告日2018-07-18

    原文格式PDF

  • 申请/专利权人 SAMSUNG ELECTRONICS CO. LTD.;

    申请/专利号KR20180002425

  • 发明设计人 JI ZHENGPING;BROTHERS WAKEFIELD JOHN;

    申请日2018-01-08

  • 分类号G06N3/08;G06N3/04;

  • 国家 KR

  • 入库时间 2022-08-21 12:39:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号