首页> 外国专利> SYSTEMS AND METHODS FOR PRUNING NEURAL NETWORKS FOR RESOURCE EFFICIENT INFERENCE

SYSTEMS AND METHODS FOR PRUNING NEURAL NETWORKS FOR RESOURCE EFFICIENT INFERENCE

机译:用于修剪资源有效推断的神经网络的系统和方法

摘要

A method, computer readable medium, and system are disclosed for neural network pruning. The method includes the steps of receiving first-order gradients of a cost function relative to layer parameters for a trained neural network and computing a pruning criterion for each layer parameter based on the first-order gradient corresponding to the layer parameter, where the pruning criterion indicates an importance of each neuron that is included in the trained neural network and is associated with the layer parameter. The method includes the additional steps of identifying at least one neuron having a lowest importance and removing the at least one neuron from the trained neural network to produce a pruned neural network.
机译:公开了一种用于神经网络修剪的方法,计算机可读介质和系统。该方法包括以下步骤:接收相对于训练后的神经网络的层参数的成本函数的一阶梯度,并基于与该层参数相对应的一阶梯度,为每个层参数计算修剪准则,其中,修剪准则表示包含在受训神经网络中并与层参数关联的每个神经元的重要性。该方法包括另外的步骤:识别具有最低重要性的至少一个神经元,以及从训练后的神经网络中去除至少一个神经元以产生修剪的神经网络。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号