首页>
外国专利>
NEURAL NETWORK QUANTIZATION METHOD USING MULTIPLE REFINED QUANTIZED KERNELS FOR CONSTRAINED HARDWARE DEPLOYMENT
NEURAL NETWORK QUANTIZATION METHOD USING MULTIPLE REFINED QUANTIZED KERNELS FOR CONSTRAINED HARDWARE DEPLOYMENT
展开▼
机译:约束硬件部署的多重精细化核神经网络量化方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method of configuring a neural network, trained from a plurality of data samples, comprising: quantizing each layer of the neural network to produce a quantized neural network according to a plurality of respective scaling factors; locating one or more layers of the quantized neural network; computing a modified quantization for the one or more located layers to produce a modified quantized neural network; and adjusting the plurality of scaling factors of the modified quantized neural network by computing a similarity between a plurality of neural network outputs and a plurality of modified quantized neural network outputs.
展开▼