首页>
外国专利>
METHODS AND SYSTEMS FOR CONVERTING WEIGHTS OF A DEEP NEURAL NETWORK FROM A FIRST NUMBER FORMAT TO A SECOND NUMBER FORMAT
METHODS AND SYSTEMS FOR CONVERTING WEIGHTS OF A DEEP NEURAL NETWORK FROM A FIRST NUMBER FORMAT TO A SECOND NUMBER FORMAT
展开▼
机译:将深神经网络的权重从第一号格式转换为第二号格式的方法和系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and system for converting a plurality of weights of a filter of a Deep Neural Network (DNN) in a first number format to a second number format, the second number format having less precision than the first number format, to enable the DNN to be implemented in hardware logic. The method comprising: determining, for each of the plurality of weights, a quantisation error associated with quantising that weight to the second number format in accordance with a first quantisation method; determining a total quantisation error for the plurality of weights based on the quantisation errors for the plurality of weights; identifying a subset of the plurality of weights to be quantised to the second number format in accordance with a second quantisation method based on the total quantisation error for the plurality of weights; and generating a set of quantised weights representing the plurality of weights in the second number format, the quantised weight for each weight in the subset of the plurality of weights based on quantising that weight to the second number format in accordance with the second quantisation method and the quantised weight for each of the remaining weights of the plurality of weights based on quantising that weight to the second number format in accordance with the first quantisation method.
展开▼