首页> 外文会议>International Conference on Machine Learning >CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
【24h】

CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning

机译:好奇:内在动机的模块化多目标强化学习

获取原文

摘要

In open-ended environments, autonomous learning agents must set their own goals and build their own curriculum through an intrinsically motivated exploration. They may consider a large diversity of goals, aiming to discover what is controllable in their environments, and what is not. Because some goals might prove easy and some impossible, agents must actively select which goal to practice at any moment, to maximize their overall mastery on the set of learnable goals. This paper proposes CURIOUS, an algorithm that leverages 1) a modular Universal Value Function Approximator with hindsight learning to achieve a diversity of goals of different kinds within a unique policy and 2) an automated curriculum learning mechanism that biases the attention of the agent towards goals maximizing the absolute learning progress. Agents focus sequentially on goals of increasing complexity, and focus back on goals that are being forgotten. Experiments conducted in a new modular-goal robotic environment show the resulting developmental self-organization of a learning curriculum, and demonstrate properties of robustness to distracting goals, forgetting and changes in body properties.
机译:在开放式环境中,自主学习代理商必须通过本质上积极的探索来设定自己的目标并建立自己的课程。他们可能会考虑大量的目标,旨在发现在其环境中可控制的目标,而不是。因为某些目标可能会简单而且一些不可能的人,代理商必须在任何时候都会积极选择练习的目标,以最大限度地掌握对可读目标的整体掌握。本文提出了一种奇和的算法,它利用了1)模块化通用价值函数近似器与后可以的学习,实现了不同类型的不同类型的多样性,以及2)一种自动课程学习机制,使代理对目标的注意力偏向目标的自动化课程学习机制最大化绝对学习进度。代理商在越来越复杂的目标上依次焦点,并重点归咎于被遗忘的目标。在新的模块化机器人环境中进行的实验表明了由此产生的学习课程的发育自组织,并展示了稳健性的性质,以分散目标,遗忘和身体性质的变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号