Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking

Denise S. Feirstein; Ivan Koryakovskiy; Jens Kober; Heike Vallery

首页> 外文期刊>IFAC PapersOnLine >Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking

【24h】

Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking

机译：增强潜在场的学习以实现极限循环行走

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning is a powerful tool to derive controllers for systems where no models are available. Particularly policy search algorithms are suitable for complex systems, to keep learning time manageable and account for continuous state and action spaces. However, these algorithms demand more insight into the system to choose a suitable controller parameterization. This paper investigates a type of policy parameterization for impedance control that allows energy input to be implicitly bounded: Potential fields. In this work, a methodology for generating a potential field-constrained impedance controller via approximation of example trajectories, and subsequently improving the control policy using Reinforcement Learning, is presented. The potential field-const rained approximation is used as a policy parameterization for policy search reinforcement learning and is compared to its unconstrained counterpart. Simulations on a simple biped walking model show the learned controllers are able to surpass the potential field of gravity by generating a stable limit-cycle gait on flat ground for both parameterizations. The potential field-constrained controller provides safety with a known energy bound while performing equally well as the unconstrained policy.

机译：增强学习是为没有模型可用的系统派生控制器的强大工具。尤其是策略搜索算法适用于复杂的系统，以保持学习时间可管理并考虑连续的状态和动作空间。但是，这些算法需要对系统有更多了解，以选择合适的控制器参数化。本文研究了一种用于阻抗控制的策略参数化方法，该方法可隐式限制能量输入：势场。在这项工作中，提出了一种方法，该方法可通过示例轨迹的逼近来生成势场受限的阻抗控制器，然后使用强化学习来改进控制策略。潜在的场常量降雨近似用作策略搜索强化学习的策略参数化，并与它的无约束对应项进行比较。在简单的两足动物步行模型上进行的仿真表明，通过学习，控制器可以通过在两个参数化的平坦地面上生成稳定的极限循环步态，从而超越潜在的重力场。潜在的受现场约束的控制器以已知的能量范围为安全提供了保障，同时其性能与不受约束的策略一样好。

著录项

来源
《IFAC PapersOnLine》 |2016年第14期|共6页
作者
Denise S. Feirstein; Ivan Koryakovskiy; Jens Kober; Heike Vallery;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A learning architecture based on reinforcement learning for adaptive control of the walking machine LAURON [J] . Winfried Ilg, Karsten Berns Robotics and Autonomous Systems . 1995,第4期

机译：基于强化学习的步行机LAURON自适应控制的学习架构
2. Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments [J] . Cao Xiang, Sun Hongbing, Guo Liqiang International Journal of Control . 2020,第7a9期

机译：三维水下环境中多AUV的潜在场分层加强学习方法
3. A NOVEL ARTIFICIAL POTENTIAL FIELD-BASED REINFORCEMENT LEARNING FOR MOBILE ROBOTICS IN AMBIENT INTELLIGENCE [J] . H. Chen, L. Xie International Journal of Robotics & Automation . 2009,第3期

机译：基于智能势场的新型移动机器人智能学习
4. Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking [C] . Denise S. Feirstein, Ivan Koryakovskiy, Jens Kober, IFAC Workshop on Periodic Control Systems . 2016

机译：潜在领域的加固学习实现极限循环步行
5. Multi-User Redirected Walking and Resetting Utilizing Artificial Potential Fields [D] . Hoffbauer, Cole Thornton 2018

机译：利用人工势场的多用户重定向步行和重置
6. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [O] . Minija Tamosiunaite, Tamim Asfour, Florentin Wörgötter -1

机译：通过使用连续动作的基于受体场的函数逼近方法通过强化学习来学习达到
7. Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking [O] . Denise S. Feirstein, Ivan Koryakovskiy, Jens Kober, 2016

机译：潜在领域的加固学习实现极限循环步行
8. Achieving Near-Optimal Sensor Allocation Policies Through Reinforcement Learning [R] . Malhotra, P. 1996

机译：通过强化学习实现近乎最佳的传感器分配策略

Reinforcement Learning of Potential Fields to achieve Limit-Cycle Walking

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅