首页> 外国专利> EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

EXPLORING AN UNEXPLORED DOMAIN BY PARALLEL REINFORCEMENT

机译:通过并行加固探索未占用的域

摘要

Example embodiments describe a computer-implemented method for exploring, by a table-based parallel reinforcement learning, PRL, algorithm, an unexplored domain (100) comprising a plurality of agents (110-114) and states, the unexplored domain (100) represented by a state-action space (101, 102), the method comprising the following steps performed by one or more of the plurality of agents (110) receiving (510) an assigned partition (200) of the state-action space represented by a table; and executing (511) during a plurality of episodes actions for states within the partition (200), wherein an action transits a state; and granting (512) to a transited state a reward; and exchanging (513) state-action values with other agents of the plurality of agents (111-114) in the domain (100); and updating (514) the table.
机译:示例实施例描述了一种计算机实现的方法,该方法用于通过基于表的并行增强学习,PRL算法探索包括多个代理(110-114)和状态的未探索域(100),所代表的未探索域(100)通过状态动作空间(101、102),该方法包括以下步骤:由多个代理(110)中的一个或多个接收(510)由状态代理空间表示的状态动作空间的分配分区(200)来执行表;在分区的多个情节中,对分区(200)内的状态执行动作(511),其中动作转变状态;给予(512)过境国奖励;与域(100)中的多个代理(111-114)中的其他代理交换(513)状态作用值;并更新(514)表。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号