首页> 外文会议>IEEE International Conference on Robotics and Automation >Learning contact-rich manipulation skills with guided policy search
【24h】

Learning contact-rich manipulation skills with guided policy search

机译:通过指导性策略搜索学习接触丰富的操作技能

获取原文
获取外文期刊封面目录资料

摘要

Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world. However, current motion skill learning methods typically restrict the behavior to a compact, low-dimensional representation, limiting its expressiveness and generality. In this paper, we extend a recently developed policy search method [1] and use it to learn a range of dynamic manipulation behaviors with highly general policy representations, without using known models or example demonstrations. Our approach learns a set of trajectories for the desired motion skill by using iteratively refitted time-varying linear models, and then unifies these trajectories into a single control policy that can generalize to new situations. To enable this method to run on a real robot, we introduce several improvements that reduce the sample count and automate parameter selection. We show that our method can acquire fast, fluent behaviors after only minutes of interaction time, and can learn robust controllers for complex tasks, including putting together a toy airplane, stacking tight-fitting lego blocks, placing wooden rings onto tight-fitting pegs, inserting a shoe tree into a shoe, and screwing bottle caps onto bottles.
机译:自主学习对象操作技能可以使机器人获得丰富的行为方式,以适应现实世界中发现的各种对象。但是,当前的运动技能学习方法通​​常将行为限制为紧凑的低维表示,从而限制了其表现力和通用性。在本文中,我们扩展了最近开发的策略搜索方法[1],并使用它来学习具有高度通用策略表示的一系列动态操纵行为,而无需使用已知的模型或示例演示。我们的方法通过使用迭代重新拟合的时变线性模型来学习所需运动技能的一组轨迹,然后将这些轨迹统一为一个可以推广到新情况的单个控制策略。为了使这种方法能够在真实的机器人上运行,我们引入了一些改进措施,这些方法可以减少样本数量并自动执行参数选择。我们证明了我们的方法仅需互动几分钟即可获得快速,流畅的行为,并且可以学习用于复杂任务的强大控制器,包括组装玩具飞机,堆叠紧身的乐高积木,将木环放到紧身的钉子上,将a树插入鞋子中,然后将瓶盖拧到瓶子上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号