Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes

机译：通过生成性运动反射通过指导性策略搜索来学习强大的操纵技能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Guided Policy Search enables robots to learn control policies for complex manipulation tasks efficiently. Therein, the control policies are represented as high-dimensional neural networks which derive robot actions based on states. However, due to the small number of real-world trajectory samples in Guided Policy Search, the resulting neural networks are only robust in the neighbourhood of the trajectory distribution explored by real-world interactions. In this paper, we present a new policy representation called Generative Motor Reflexes, which is able to generate robust actions over a broader state space compared to previous methods. In contrast to prior state-action policies, Generative Motor Reflexes map states to parameters for a state-dependent motor reflex, which is then used to derive actions. Robustness is achieved by generating similar motor reflexes for many states. We evaluate the presented method in simulated and real-world manipulation tasks, including contact-rich peg-in-hole tasks. Using these evaluation tasks, we show that policies represented as Generative Motor Reflexes lead to robust manipulation skills also outside the explored trajectory distribution with less training needs compared to previous methods.

机译：引导策略搜索使机器人能够有效地学习用于复杂操作任务的控制策略。其中，控制策略以高维神经网络表示，该神经网络基于状态得出机器人动作。但是，由于在“指导策略搜索”中的真实世界轨迹样本数量很少，因此所得的神经网络仅在真实世界交互作用探索的轨迹分布附近具有鲁棒性。在本文中，我们提出了一种新的策略表示形式，称为Generative Motor Reflexes，与以前的方法相比，它能够在更广泛的状态空间上生成可靠的动作。与先前的状态动作策略相比，“生成运动反射”将状态映射到用于状态相关的运动反射的参数，然后将其用于导出动作。通过在许多状态下产生相似的运动反射来实现鲁棒性。我们在模拟和实际操作任务中评估了本文提出的方法，包括接触丰富的孔中钉任务。使用这些评估任务，我们表明，以生成运动反射为代表的策略还可以在探索的轨迹分布之外产生强大的操纵技能，与以前的方法相比，培训需求更少。

著录项

来源
《International Conference on Robotics and Automation》|2019年|7851-7857|共7页
会议地点 Montreal(CA)
作者
Philipp Ennen; Pia Bresenitz; Rene Vossen; Frank Hees;
展开▼
作者单位

Cybernetics Lab IMA & IfU at RWTH Aachen University Germany;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neural networks; Trajectory; Robots; Robustness; Reinforcement learning; Space exploration; Training;

机译：神经网络;弹道;机器人；坚固性强化学习；太空探索;训练;
入库时间 2022-08-26 14:42:13

相似文献

外文文献
中文文献
专利

1. Reinforcement learning of motor skills using Policy Search and human corrective advice [J] . The International journal of robotics research . 2019,第14期

机译：使用策略搜索和人工纠正建议加强运动技能的学习
2. Cognitive and Motor Learning in Internally-Guided Motor Skills [J] . Krishn Bera, Anuj Shukla, Raju S. Bapi Frontiers in Psychology . 2021,第a期

机译：内部导向机动技能的认知和电机学习
3. The Influence of Guided Error-Based Learning on Motor Skills Self-Efficacy and Achievement [J] . Chien Kuei-Pin, Chen Sufen Journal of motor behavior . 2018,第1a6期

机译：基于误区的影响对机动技能自我效能和成就的影响
4. Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes [C] . Philipp Ennen, Pia Bresenitz, Rene Vossen, International Conference on Robotics and Automation . 2019

机译：通过生成电机反射学习具有导向政策的强大操纵技能
5. Self-Efficacy as a Generative Mechanism for Future Self-Guides and Feedback-Seeking Behavior in Language Learning [D] . Bondarenko, Anna Vitalyevna. 2020

机译：自我效能作为未来自我指导和语言学习中的反馈行为的生成机制
6. Learning a Set of Interrelated Tasks by Using a Succession of Motor Policies for a Socially Guided Intrinsically Motivated Learner [O] . Nicolas Duminy, Sao Mai Nguyen, Dominique Duhaut 2018

机译：通过使用一系列运动策略为社会指导的内在动机学习者学习一系列相互关联的任务
7. Learning Contact-Rich Manipulation Skills with Guided Policy Search [O] . Levine, Sergey, Wagener, Nolan, Abbeel, Pieter 2015

机译：通过引导式策略搜索学习联系人丰富的操作技巧

Learning Robust Manipulation Skills with Guided Policy Search via Generative Motor Reflexes

摘要

著录项

相似文献

相关主题

期刊订阅