首页> 外文会议>International Conference on Development and Learning and Epigenetic Robotics >Behavioral Repertoire via Generative Adversarial Policy Networks
【24h】

Behavioral Repertoire via Generative Adversarial Policy Networks

机译:通过生成对抗策略网络的行为方式

获取原文

摘要

Learning algorithms are enabling robots to solve increasingly challenging real-world tasks. These approaches often rely on demonstrations and reproduce the behavior shown. Unexpected changes in the environment may require using different behaviors to achieve the same effect, for instance to reach and grasp an object in changing clutter. An emerging paradigm addressing this robustness issue is to learn a diverse set of successful behaviors for a given task, from which a robot can select the most suitable policy when faced with a new environment. In this paper, we explore a novel realization of this vision by learning a generative model over policies. Rather than learning a single policy, or a small fixed repertoire, our generative model for policies compactly encodes an unbounded number of policies and allows novel controller variants to be sampled. Leveraging our generative policy network, a robot can sample novel behaviors until it finds one that works for a new environment. We demonstrate this idea with an application of robust ball-throwing in the presence of obstacles. We show that this approach achieves a greater diversity of behaviors than an existing evolutionary approach, while maintaining good efficacy of sampled behaviors, allowing a Baxter robot to hit targets more often when ball throwing in the presence of obstacles.
机译:学习算法使机器人能够解决日益严峻的现实任务。这些方法通常依赖于演示并重现所显示的行为。环境中意料之外的变化可能需要使用不同的行为来达到相同的效果,例如在变化的杂波中达到并抓住物体。解决此鲁棒性问题的新兴范例是学习给定任务的多种成功行为,当机器人面对新环境时,可以从中选择最合适的策略。在本文中,我们通过学习政策生成模型来探索这种愿景的新颖实现。我们的策略生成模型无需学习单个策略或小的固定资源,而是紧凑地编码了无数策略,并允许对新型控制器变量进行采样。利用我们的生成策略网络,机器人可以对新颖的行为进行采样,直到找到适用于新环境的行为为止。我们通过在存在障碍物的情况下进行稳健的掷球来证明这一想法。我们证明,与现有的进化方法相比,此方法可实现更大的行为多样性,同时保持良好的采样行为效果,使巴克斯特机器人在有障碍物的情况下投球时可以更频繁地击中目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号