首页>
外国专利>
METHOD FOR COMPRESSING NEURAL NETWORK MODEL, DEVICE, AND COMPUTER APPARATUS
METHOD FOR COMPRESSING NEURAL NETWORK MODEL, DEVICE, AND COMPUTER APPARATUS
展开▼
机译:神经网络模型,装置和计算机设备的压缩方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for compressing a neural network model, a device, a computer apparatus, and a computer readable medium. The method comprises: acquiring a first trained neural network model (S202); selecting one or more layers from layers of the first neural network model as layers to be compressed (S204); sorting the layers to be compressed according to a pre-determined rule (S206); and compressing, according to a sequential order from the sorting and by means of a genetic algorithm, a portion or all of the layers to be compressed, and obtaining a second neural network model (S208), wherein the accuracy of the second neural network model based on a pre-configured training sample is not less than a pre-determined accuracy value. The method, the device, the computer apparatus, and the computer readable medium compress a trained neural network model by means of a genetic algorithm, thereby reducing a computational load and storage space of the neural network model, and providing applicability of the same to apparatuses having limited memory and computational resources without compromising accuracy or compression of the neural network model.
展开▼