首页> 外国专利> NEURAL NETWORK COMPUTATION ACCELERATION METHOD AND SYSTEM BASED ON NON-UNIFORM QUANTIZATION AND LOOK-UP TABLE

NEURAL NETWORK COMPUTATION ACCELERATION METHOD AND SYSTEM BASED ON NON-UNIFORM QUANTIZATION AND LOOK-UP TABLE

机译:基于非均匀量化和查找表的神经网络计算加速方法及系统

摘要

A neural network computation acceleration method and system based on non-uniform quantization and a look-up table. The method (300) comprises: performing non-uniform quantization on parameters of each layer of a neural network (S310); performing non-uniform quantization on inputs of each layer of the neural network (S320); constructing a look-up table for each layer by multiplying each of the quantization values of the parameters of said layer by each of the quantization values of the inputs of said layer (S330); and when forward computation of the neural network is to be performed, looking up a result of multiplication computation of the parameters and inputs of each layer in the look-up table of said layer, and performing computation layer by layer until all computation is done (S340). The method performs non-uniform quantization on all parameters and inputs of a neural network, and further adopts a look-up table to replace multiplication computation, thus accelerating computation of the neural network.
机译:基于非均匀量化和查找表的神经网络计算加速方法和系统。方法(300)包括:对神经网络的每一层的参数执行非均匀量化(S310);在神经网络的每一层的输入上执行非均匀量化(S320);通过将所述层的参数的每个量化值乘以所述层的输入的每个量化值,来构造用于每个层的查找表(S330);当要进行神经网络的正向计算时,在所述层的查找表中查找每个层的参数和输入的乘法计算结果,并逐层执行计算,直到完成所有计算为止( S340)。该方法对神经网络的所有参数和输入执行非均匀量化,并进一步采用查找表代替乘法计算,从而加速了神经网络的计算。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号