首页> 外国专利> NEURAL NETWORK COMPUTATION ACCELERATION METHOD AND SYSTEM BASED ON NON-UNIFORM QUANTIZATION AND LOOK-UP TABLE

NEURAL NETWORK COMPUTATION ACCELERATION METHOD AND SYSTEM BASED ON NON-UNIFORM QUANTIZATION AND LOOK-UP TABLE

机译：基于非均匀量化和查找表的神经网络计算加速方法及系统

页面导航

摘要
著录项
相似文献

摘要

A neural network computation acceleration method and system based on non-uniform quantization and a look-up table. The method (300) comprises: performing non-uniform quantization on parameters of each layer of a neural network (S310); performing non-uniform quantization on inputs of each layer of the neural network (S320); constructing a look-up table for each layer by multiplying each of the quantization values of the parameters of said layer by each of the quantization values of the inputs of said layer (S330); and when forward computation of the neural network is to be performed, looking up a result of multiplication computation of the parameters and inputs of each layer in the look-up table of said layer, and performing computation layer by layer until all computation is done (S340). The method performs non-uniform quantization on all parameters and inputs of a neural network, and further adopts a look-up table to replace multiplication computation, thus accelerating computation of the neural network.

机译：基于非均匀量化和查找表的神经网络计算加速方法和系统。方法（300）包括：对神经网络的每一层的参数执行非均匀量化（S310）;在神经网络的每一层的输入上执行非均匀量化（S320）;通过将所述层的参数的每个量化值乘以所述层的输入的每个量化值，来构造用于每个层的查找表（S330）;当要进行神经网络的正向计算时，在所述层的查找表中查找每个层的参数和输入的乘法计算结果，并逐层执行计算，直到完成所有计算为止（ S340）。该方法对神经网络的所有参数和输入执行非均匀量化，并进一步采用查找表代替乘法计算，从而加速了神经网络的计算。

著录项

公开/公告号WO2019080483A1

专利类型
公开/公告日2019-05-02

原文格式PDF
申请/专利权人 BEIJING DEEPHI INTELLIGENT TECHNOLOGY CO. LTD.;
展开▼

申请/专利号WO2018CN87117
发明设计人 JIANG FAN;WANG YU;SHENG XIAO;HAN SONG;SHAN YI;
展开▼

申请日2018-05-16
分类号G06N3/08;
国家 WO
入库时间 2022-08-21 11:55:03

相似文献

专利
外文文献
中文文献