NETWORK QUANTIZATION METHOD, INFERENCE METHOD, AND NETWORK QUANTIZATION DEVICE
展开▼
机译:网络量化方法,推理方法和网络量化设备
展开▼
页面导航
摘要
著录项
相似文献
摘要
This network quantization method for quantizing a neural network (14) includes: a database construction step (S20) for constructing a statistical information database (18) of tensors that are dealt with by the neural network (14) and obtained when a plurality of test data sets (12) are input to the neural network (14); a parameter generation step (S30) for generating a quantization parameter set by quantizing tensor values; and a network construction step (S40) for quantizing the neural network (14) by using the quantization parameter set (22), wherein, on the basis of the statistical information database (18), the parameter generation step (S30) sets, among the tensor values, a quantization step interval in a high frequency area including tensor values of the maximum frequency to be narrower than that in a low frequency area including tensor values having non-zero frequency and less frequency than the high frequency area.
展开▼