首页> 外国专利> EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

机译：通过并行加强探索未开发的领域

页面导航

摘要
著录项
相似文献

摘要

Example embodiments describe a computer-implemented method for exploring, by a table-based parallel reinforcement learning, PRL, algorithm, an unexplored domain (100) comprising a plurality of agents (110-114) and states, the unexplored domain (100) represented by a state-action space (101, 102), the method comprising the following steps performed by one or more of the plurality of agents (110) receiving (510) an assigned partition (200) of the state-action space represented by a table; and executing (511) during a plurality of episodes actions for states within the partition (200), wherein an action transits a state; and granting (512) to a transited state a reward; and exchanging (513) state-action values with other agents of the plurality of agents (111-114) in the domain (100); and updating (514) the table.

机译：示例实施例描述了一种用于探索的计算机实现的方法，通过基于表的并行增强学习，PRL，算法，包括多个代理（110-114）和状态的未探测域（100），所示的未探究域（100）表示通过状态动作空间（101,102），该方法包括由多个代理中的一个或多个（110）中的一个或多个接收（510）由A表示的状态动作空间的分配分区（200）执行的以下步骤桌子; 在分区（200）内的状态的多个剧集动作期间执行（511），其中动作过期状态; 并给予过渡州的（512）奖励; 与域（100）中的多种试剂（111-114）的其他试剂交换（513）状态 - 动作值; 并更新（514）表。

著录项

公开/公告号EP3864510A1

专利类型
公开/公告日2021-08-18

原文格式PDF
申请/专利权人 IMEC VZW;UNIVERSITEIT ANTWERPEN;
展开▼

申请/专利号EP20190784060
发明设计人 CLAEYS MAXIM;CAMELO MIGUEL;LATRÉ STEVEN;
展开▼

申请日2019-10-11
分类号G06F9/46;
国家 EP
入库时间 2022-08-24 20:39:38

相似文献

专利
外文文献
中文文献