首页>
外国专利>
DEEP CONVOLUTIONAL NEURAL NETWORK ACCELERATION AND COMPRESSION METHOD BASED ON PARAMETER QUANTIFICATION
DEEP CONVOLUTIONAL NEURAL NETWORK ACCELERATION AND COMPRESSION METHOD BASED ON PARAMETER QUANTIFICATION
展开▼
机译:基于参数量化的深卷积神经网络加速与压缩方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention provides a deep convolutional neural network acceleration and compression method based on parameter quantification, comprising: performing quantification on parameters of a deep convolutional neural network, to obtain multiple sub-codebooks and index values separately corresponding to the multiple sub-codebooks; and obtaining a feature graph of output of the deep convolutional neural network according to the multiple sub-codebooks and the index values separately corresponding to the multiple sub-codebooks. By means of the present invention, the acceleration and compression of a deep convolutional neural network can be implemented.
展开▼