首页> 外国专利> AUTOMATIC MACHINE LEARNING POLICY NETWORK FOR PARAMETRIC BINARY NEURAL NETWORKS

AUTOMATIC MACHINE LEARNING POLICY NETWORK FOR PARAMETRIC BINARY NEURAL NETWORKS

机译:参数二元神经网络的自动机器学习策略网络

摘要

Systems, methods, apparatuses, and computer program products to receive a plurality of binary weight values for a binary neural network sampled from a policy neural network comprising a posterior distribution conditioned on a theta value. An error of a forward propagation of the binary neural network may be determined based on a training data and the received plurality of binary weight values. A respective gradient value may be computed for the plurality of binary weight values based on a backward propagation of the binary neural network. The theta value for the posterior distribution may be updated using reward values computed based on the gradient values, the plurality of binary weight values, and a scaling factor.
机译:用于接收从策略神经网络采样的二进制神经网络的多个二进制权重值的系统、方法、装置和计算机程序产品,该策略神经网络包括以θ值为条件的后验分布。可以基于训练数据和接收到的多个二进制权重值来确定二进制神经网络的前向传播误差。可以基于二进制神经网络的反向传播为多个二进制权重值计算相应的梯度值。可以使用基于梯度值、多个二进制权重值和比例因子计算的奖励值来更新后验分布的θ值。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号