首页> 外国专利> COOPERATIVE NEURAL NETWORK DEEP REINFORCEMENT LEARNING WITH PARTIAL INPUT ASSISTANCE

COOPERATIVE NEURAL NETWORK DEEP REINFORCEMENT LEARNING WITH PARTIAL INPUT ASSISTANCE

机译:带有部分输入协助的合作神经网络深度强化学习

摘要

Deep reinforcement learning of cooperative neural networks can be performed by obtaining an action and observation sequence including a plurality of time frames, each time frame including action values and observation values. At least some of the observation values of each time frame of the action and observation sequence can be input sequentially into a first neural network including a plurality of first parameters. The action values of each time frame of the action and observation sequence and output values from the first neural network corresponding to the at least some of the observation values of each time frame of the action and observation sequence can be input sequentially into a second neural network including a plurality of second parameters. An action-value function can be approximated using the second neural network, and the plurality of first parameters of the first neural network can be updated using backpropagation.
机译:协作神经网络的深度强化学习可以通过获取包含多个时间范围的动作和观察序列来执行,每个时间帧都包含动作值和观察值。可以将动作和观察序列的每个时间帧的观察值中的至少一些顺序输入到包括多个第一参数的第一神经网络中。可以将动作和观察序列的每个时间框架的动作值以及对应于动作和观察序列的每个时间框架的至少一些观察值的来自第一神经网络的输出值依次输入到第二神经网络包括多个第二参数。可以使用第二神经网络来近似动作值函数,并且可以使用反向传播来更新第一神经网络的多个第一参数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号