首页> 外文会议>International Conference on Development and Learning and Epigenetic Robotics >Behavioral Repertoire via Generative Adversarial Policy Networks
【24h】

Behavioral Repertoire via Generative Adversarial Policy Networks

机译:通过生成的对抗政策网络行为曲目

获取原文

摘要

Learning algorithms are enabling robots to solve increasingly challenging real-world tasks. These approaches often rely on demonstrations and reproduce the behavior shown. Unexpected changes in the environment may require using different behaviors to achieve the same effect, for instance to reach and grasp an object in changing clutter. An emerging paradigm addressing this robustness issue is to learn a diverse set of successful behaviors for a given task, from which a robot can select the most suitable policy when faced with a new environment. In this paper, we explore a novel realization of this vision by learning a generative model over policies. Rather than learning a single policy, or a small fixed repertoire, our generative model for policies compactly encodes an unbounded number of policies and allows novel controller variants to be sampled. Leveraging our generative policy network, a robot can sample novel behaviors until it finds one that works for a new environment. We demonstrate this idea with an application of robust ball-throwing in the presence of obstacles. We show that this approach achieves a greater diversity of behaviors than an existing evolutionary approach, while maintaining good efficacy of sampled behaviors, allowing a Baxter robot to hit targets more often when ball throwing in the presence of obstacles.
机译:学习算法使机器人能够解决越来越具有挑战性的真实世界任务。这些方法通常依赖于演示并重现所显示的行为。环境中的意外变化可能需要使用不同的行为来实现相同的效果,例如达到和掌握在更改杂波中的对象。解决这种稳健性问题的新兴范式是为特定任务学习一个不同的成功行为,机器人可以在面对新环境时选择最合适的策略。在本文中,我们通过在政策中学习生成模型来探索这一愿景的新颖实现。我们的生成模型而不是学习单一策略,而不是学习单一策略,而是用于策略的生成模型,紧凑地编码无限的策略,并允许采样新的控制器变体。利用我们的生成政策网络,机器人可以在找到一个适用于新环境的行为之前来样的新行为。我们在存在障碍物的情况下展示了这种想法。我们表明,这种方法比现有的进化方法实现了更大的行为,同时保持采样行为的良好功效,允许百乐者机器人在球在存在障碍物的存在时更频繁地击中目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号