首页> 外国专利> Methods and systems for converting weights of a deep neural network from a first number format to a second number format

Methods and systems for converting weights of a deep neural network from a first number format to a second number format

机译:用于将深度神经网络的权重从第一数字格式转换为第二数字格式的方法和系统

摘要

Converting a plurality of weights of a filter of a Deep Neural Network (DNN) from a first to a second number format to enable the DNN to be implemented in hardware logic, the second format having less precision than the first. The conversion comprising determining, for each of the plurality of weights, a quantisation error associated with quantising that weight to the second format in accordance with a first quantisation method 402. The total quantisation error is determined for the plurality of weights based on said quantisation errors 404. A subset of the weights is identified to be quantised to the second format in accordance with a second quantisation method based on the total quantisation error 406. A set of quantised weights is generated, each weight in the subset based on quantising that weight to the second format in accordance with the second quantisation method and each of the remaining weights based on quantising that weight to the second format in accordance with the first quantisation method 408.
机译:将深度神经网络(DNN)的滤波器的多个权重从第一格式转换为第二数字格式,以使DNN能够以硬件逻辑实现,第二格式的精度低于第一格式。该转换包括针对多个权重中的每个权重,确定与根据第一量化方法402将该权重量化为第二格式相关联的量化误差。基于所述量化误差来确定多个权重的总量化误差。 404。基于总量化误差406,根据第二量化方法,将权重的子集识别为量化为第二格式。生成一组量化权重,子集中的每个权重基于将该权重量化为根据第二量化方法的第二格式,以及基于根据第一量化方法408将该权重量化为第二格式的剩余权重中的每一个。

著录项

  • 公开/公告号GB201912083D0

    专利类型

  • 公开/公告日2019-10-09

    原文格式PDF

  • 申请/专利权人 IMAGINATION TECHNOLOGIES LIMITED;

    申请/专利号GB20190012083

  • 发明设计人

    申请日2019-08-22

  • 分类号

  • 国家 GB

  • 入库时间 2022-08-21 11:43:21

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号