首页> 外国专利> METHOD FOR AUTOMATIC HYBRID QUANTIZATION OF DEEP ARTIFICIAL NEURAL NETWORKS

METHOD FOR AUTOMATIC HYBRID QUANTIZATION OF DEEP ARTIFICIAL NEURAL NETWORKS

机译:深层人工神经网络自动混合量化的方法

摘要

A method includes, for each floating-point layer in a set of floating-point layers: calculating a set of input activations and a set of output activations of the floating-point layer; converting the floating-point layer to a low-bit-width layer; calculating a set of low-bit-width output activations based on the set of input activations; and calculating a per-layer deviation statistic of the low-bit-width layer. The method also includes ordering the set of low-bit-width layers based on the per-layer deviation statistic of each low-bit-width layer. The method additionally includes, while a loss-of-accuracy threshold exceeds the accuracy of the quantized network: converting a floating-point layer represented by the low-bit-width layer to a high-bit-width layer; replacing the low-bit-width layer with the high-bit-width layer in the quantized network; updating the accuracy of the quantized network; and, in response to the accuracy of the quantized network exceeding the loss-of-accuracy threshold, returning the quantized network.
机译:一种方法包括,对于一组浮点层中的每个浮点层:计算一组输入激活和浮点层的一组输出激活;将浮点层转换为低位宽度层;基于输入激活集计算一组低位宽度输出激活;并计算低位宽度层的每个层偏差统计。该方法还包括基于每个低位宽度层的每个层偏差统计来排序一组低位宽度层。该方法还包括,虽然精度损失阈值超过量化网络的精度:将由低位宽度层表示的浮点层转换为高比特宽度层;用量化网络中的高位宽度层替换低位宽度层;更新量化网络的准确性;并且,响应量化网络超过高精度阈值的准确性,返回量化网络。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号