首页> 外国专利> COOPERATIVE NEURAL NETWORK DEEP REINFORCEMENT LEARNING WITH PARTIAL INPUT ASSISTANCE

COOPERATIVE NEURAL NETWORK DEEP REINFORCEMENT LEARNING WITH PARTIAL INPUT ASSISTANCE

机译：带有部分输入协助的合作神经网络深度强化学习

页面导航

摘要
著录项
相似文献

摘要

Deep reinforcement learning of cooperative neural networks can be performed by obtaining an action and observation sequence including a plurality of time frames, each time frame including action values and observation values. At least some of the observation values of each time frame of the action and observation sequence can be input sequentially into a first neural network including a plurality of first parameters. The action values of each time frame of the action and observation sequence and output values from the first neural network corresponding to the at least some of the observation values of each time frame of the action and observation sequence can be input sequentially into a second neural network including a plurality of second parameters. An action-value function can be approximated using the second neural network, and the plurality of first parameters of the first neural network can be updated using backpropagation.

机译：协作神经网络的深度强化学习可以通过获取包含多个时间范围的动作和观察序列来执行，每个时间帧都包含动作值和观察值。可以将动作和观察序列的每个时间帧的观察值中的至少一些顺序输入到包括多个第一参数的第一神经网络中。可以将动作和观察序列的每个时间框架的动作值以及对应于动作和观察序列的每个时间框架的至少一些观察值的来自第一神经网络的输出值依次输入到第二神经网络包括多个第二参数。可以使用第二神经网络来近似动作值函数，并且可以使用反向传播来更新第一神经网络的多个第一参数。

著录项

公开/公告号US2019042943A1

专利类型
公开/公告日2019-02-07

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201715669146
发明设计人 SAKYASINGHA DASGUPTA;TAKAYUKI OSOGAMI;
展开▼

申请日2017-08-04
分类号G06N3/08;G06N3/04;G06N7;
国家 US
入库时间 2022-08-21 12:04:08

相似文献

专利
外文文献
中文文献