首页> 外文会议>International Conference on Robotics and Automation >Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes
【24h】

Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes

机译:通过生成性运动反射通过指导性策略搜索来学习强大的操纵技能

获取原文

摘要

Guided Policy Search enables robots to learn control policies for complex manipulation tasks efficiently. Therein, the control policies are represented as high-dimensional neural networks which derive robot actions based on states. However, due to the small number of real-world trajectory samples in Guided Policy Search, the resulting neural networks are only robust in the neighbourhood of the trajectory distribution explored by real-world interactions. In this paper, we present a new policy representation called Generative Motor Reflexes, which is able to generate robust actions over a broader state space compared to previous methods. In contrast to prior state-action policies, Generative Motor Reflexes map states to parameters for a state-dependent motor reflex, which is then used to derive actions. Robustness is achieved by generating similar motor reflexes for many states. We evaluate the presented method in simulated and real-world manipulation tasks, including contact-rich peg-in-hole tasks. Using these evaluation tasks, we show that policies represented as Generative Motor Reflexes lead to robust manipulation skills also outside the explored trajectory distribution with less training needs compared to previous methods.
机译:引导策略搜索使机器人能够有效地学习用于复杂操作任务的控制策略。其中,控制策略以高维神经网络表示,该神经网络基于状态得出机器人动作。但是,由于在“指导策略搜索”中的真实世界轨迹样本数量很少,因此所得的神经网络仅在真实世界交互作用探索的轨迹分布附近具有鲁棒性。在本文中,我们提出了一种新的策略表示形式,称为Generative Motor Reflexes,与以前的方法相比,它能够在更广泛的状态空间上生成可靠的动作。与先前的状态动作策略相比,“生成运动反射”将状态映射到用于状态相关的运动反射的参数,然后将其用于导出动作。通过在许多状态下产生相似的运动反射来实现鲁棒性。我们在模拟和实际操作任务中评估了本文提出的方法,包括接触丰富的孔中钉任务。使用这些评估任务,我们表明,以生成运动反射为代表的策略还可以在探索的轨迹分布之外产生强大的操纵技能,与以前的方法相比,培训需求更少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号