首页> 外国专利> EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

机译:通过并行加强探索未开发的领域

摘要

Example embodiments describe a computer-implemented method for exploring, by a table-based parallel reinforcement learning, PRL, algorithm, an unexplored domain (100) comprising a plurality of agents (110-114) and states, the unexplored domain (100) represented by a state-action space (101, 102), the method comprising the following steps performed by one or more of the plurality of agents (110) receiving (510) an assigned partition (200) of the state-action space represented by a table; and executing (511) during a plurality of episodes actions for states within the partition (200), wherein an action transits a state; and granting (512) to a transited state a reward; and exchanging (513) state-action values with other agents of the plurality of agents (111-114) in the domain (100); and updating (514) the table.
机译:示例实施例描述了一种用于探索的计算机实现的方法,通过基于表的并行增强学习,PRL,算法,包括多个代理(110-114)和状态的未探测域(100),所示的未探究域(100)表示 通过状态动作空间(101,102),该方法包括由多个代理中的一个或多个(110)中的一个或多个接收(510)由A表示的状态动作空间的分配分区(200)执行的以下步骤 桌子; 在分区(200)内的状态的多个剧集动作期间执行(511),其中动作过期状态; 并给予过渡州的(512)奖励; 与域(100)中的多种试剂(111-114)的其他试剂交换(513)状态 - 动作值; 并更新(514)表。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号