首页> 外国专利> Parametric Power-Of-2 Clipping Activations for Quantization for Convolutional Neural Networks

Parametric Power-Of-2 Clipping Activations for Quantization for Convolutional Neural Networks

机译:用于量化卷积神经网络的参数功率-2剪切激活

摘要

In described examples of a method for quantizing data for a convolutional neural network (CNN) is provided. A set of data is received and quantized the using a power-of-2 parametric activation (PACT2) function. The PACT2 function arranges the set of data as a histogram and discards a portion of the data corresponding to a tail of the histogram to form a remaining set of data. A clipping value is determined by expanding the remaining set of data to a nearest power of two value. The set of data is then quantized using the clipping value. With PACT2, a model can be quantized either using post training quantization or using quantization aware training. PACT2 helps a quantized model to achieve close accuracy compared to the corresponding floating-point model.
机译:在描述的描述中提供了用于量化神经网络(CNN)的量化数据的方法的示例。接收和量化使用2个电源 - 2个参数激活(PACT2)功能。 PACT2功能将数据集设置为直方图,并丢弃与直方图的尾部相对应的数据的一部分以形成剩余的数据集。通过将剩余的数据集扩展到最接近的两个值的最电量来确定剪切值。然后使用剪切值量化该组数据。对于PACT2,可以使用Post训练量化或使用量化意识培训来量化模型。与相应的浮点模型相比,PACT2有助于量化模型实现接近精度。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号