首页> 外国专利> EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

机译：通过并行加固探索未占用的域

页面导航

摘要
著录项
相似文献

摘要

Example embodiments describe a computer-implemented method for exploring, by a table-based parallel reinforcement learning, PRL, algorithm, an unexplored domain (100) comprising a plurality of agents (110-114) and states, the unexplored domain (100) represented by a state-action space (101, 102), the method comprising the following steps performed by one or more of the plurality of agents (110) receiving (510) an assigned partition (200) of the state-action space represented by a table; and executing (511) during a plurality of episodes actions for states within the partition (200), wherein an action transits a state; and granting (512) to a transited state a reward; and exchanging (513) state-action values with other agents of the plurality of agents (111-114) in the domain (100); and updating (514) the table.

机译：示例实施例描述了一种计算机实现的方法，该方法用于通过基于表的并行增强学习，PRL算法探索包括多个代理（110-114）和状态的未探索域（100），所代表的未探索域（100）通过状态动作空间（101、102），该方法包括以下步骤：由多个代理（110）中的一个或多个接收（510）由状态代理空间表示的状态动作空间的分配分区（200）来执行表;在分区的多个情节中，对分区（200）内的状态执行动作（511），其中动作转变状态;给予（512）过境国奖励;与域（100）中的多个代理（111-114）中的其他代理交换（513）状态作用值;并更新（514）表。

著录项

公开/公告号EP3637256A1

专利类型
公开/公告日2020-04-15

原文格式PDF
申请/专利权人 IMEC VZW;UNIVERSITEIT ANTWERPEN;
展开▼

申请/专利号EP20180200069
发明设计人 CLAEYS MAXIM;CAMELO MIGUEL;LATRÉ STEVEN;
展开▼

申请日2018-10-12
分类号G06F9/46;
国家 EP
入库时间 2022-08-21 11:39:51

相似文献

专利
外文文献
中文文献