Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning

机译：基于路径积分强化学习的基于CPG的两足动物步行的相位相关轨迹优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we introduce a phase-dependent trajectory optimization method for Central Pattern Generator (CPG)-based biped walking controllers. By exploiting the synchronization property of the CPG controller, many legged locomotion studies have shown that the CPG-based walking controller is robust against external perturbations and works well in real environments. However, due to the nonlinear dynamic property of the coupled oscillator system composed of the CPG controller and the robot, analytically designing the biped trajectory to satisfy the requirements of a target walking pattern is rather difficult. Therefore, using a nonlinear optimization method is reasonable to improve the walking trajectory. To optimize the walking trajectory, a model-free optimal control method is preferable because precise modeling of the ground contact is difficult. On the other hand, model-free trajectory optimization methods have been considered as quite computationally demanding approach. However, because of recent advances in the nonlinear trajectory optimization method, using the model-free optimization method is now a realistic approach fro biped trajectory optimization. We use a path integral reinforcement learning method to improve the biped walking trajectory for CPG-based walking controllers.

机译：在这项研究中，我们介绍了基于中央模式发生器（CPG）的Biped行走控制器的相变轨迹优化方法。通过利用CPG控制器的同步特性，许多有腿的运动研究表明，基于CPG的行走控制器对外部干扰具有鲁棒性，并且在实际环境中运行良好。但是，由于由CPG控制器和机器人组成的耦合振荡器系统的非线性动力学特性，解析设计Biped轨迹以满足目标步行模式的要求相当困难。因此，采用非线性优化方法来改善步行轨迹是合理的。为了优化行走轨迹，最好采用无模型的最佳控制方法，因为很难对地面进行精确建模。另一方面，无模型轨迹优化方法已被认为是对计算要求很高的方法。但是，由于非线性轨迹优化方法的最新进展，现在使用无模型优化方法成为双向轨迹优化的现实方法。我们使用路径积分强化学习方法来改善基于CPG的步行控制器的两足动物步行轨迹。

著录项

来源
《2011 11th IEEE-RAS International Conference on Humanoid Robots》|2011年|p.255-260|共6页
会议地点
作者
Sugimoto Norikazu; Morimoto Jun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类机器人技术;
关键词

相似文献

外文文献
中文文献
专利

1. Episodic reinforcement learning control approach for biped walking [J] . Kati? Du?ko Serbian Journal of Electrical Engineering . 2012,第2期

机译：两足动物步行的情景强化学习控制方法
2. BIPED WALKING PATTERN GENERATION USING REINFORCEMENT LEARNING [J] . JUNGHO LEE, JUN HO OH International journal of humanoid robotics . 2009,第1期

机译：使用强化学习生成两步步行模式
3. Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot [J] . Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura, Robotics and Autonomous Systems . 2006,第12期

机译：用于不稳定Biped机器人的准被动动态行走的强化学习
4. Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning [C] . Sugimoto Norikazu, Morimoto Jun IEEE-RAS International Conference on Humanoid Robots . 2011

机译：基于CPG的双向散步使用路径整体增强学习的相位熵轨迹优化
5. Biped dynamic walking using reinforcement learning [D] . Benbrahim, Hamid. 1996

机译：使用强化学习的两足动物动态步行
6. Neural Networks Trained via Reinforcement Learning Stabilize Walking of a Three-Dimensional Biped Model With Exoskeleton Applications [O] . Chujun Liu, Musa L. Audu, Ronald J. Triolo, 2021

机译：通过强化学习训练的神经网络稳定了具有外骨骼应用的三维Biped模型的行走
7. Terrain Adaptive Walking of Biped Neuromuscular Virtual Human Using Deep Reinforcement Learning [O] . Jianpeng Wang, Wenhu Qin, Libo Sun 2019

机译：利用深度加强学习，地形适应性肌肉自适应散步

Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅