首页> 外文会议>IEEE-RAS International Conference on Humanoid Robots >Local Update Dynamic Policy Programming in Reinforcement Learning of Pneumatic Artificial Muscle-Driven Humanoid Hand Control
【24h】

Local Update Dynamic Policy Programming in Reinforcement Learning of Pneumatic Artificial Muscle-Driven Humanoid Hand Control

机译:局部更新动态策略规划,加固学习气动人工肌肉驱动的人形手动控制

获取原文

摘要

Pneumatic Artificial Muscle (PAM) is an attractive device to be used as an actuator for humanoid robots because of its high power-to-weight ratio and good flexibility. However, both the modeling and the controlling of PAM-driven robots are challenging due to the high nonlinearities of a PAM's air pressure dynamics and its mechanical structure. This paper focuses on applying Reinforcement Learning (RL) to the control of a PAM-driven robots without our knowledge of its model. We propose a new RL algorithm, Local Update Dynamic Policy Programming (LUDPP), as an extension of Dynamic Policy Programming (DPP). This algorithm exploits the nature of smooth policy update of DPP to considerably reduce the computational complexity in both time and space: at each iteration, this algorithm only updates the value function locally throughout the whole state-action space. We applied LUDPP to control one finger (2 DOFs with a 12-dimensional state-action space) of Shadow Dexterous Hand, a PAM-driven humanoid robot hand. Experimental results suggest that our method can achieve successful control of such a robot with a limited computational resource whereas other conventional value function based RL algorithms (DPP, LSPI) cannot.
机译:气动人造肌肉(PAM)是一种有吸引力的装置,用于用作人形机器人的致动器,因为其高功率重量比和良好的灵活性。然而,由于PAM空气压力动力学及其机械结构的高度非线性,所建模和控制普及的机器人的控制是具有挑战性的。本文侧重于将加强学习(RL)应用于控制驱动的机器人,而无需了解其模型。我们提出了一种新的RL算法,本地更新动态策略编程(LUDPP),作为动态策略编程(DPP)的扩展。该算法利用DPP的平滑策略更新的性质在各个时间和空间中显着降低计算复杂性:在每次迭代时,该算法仅更新整个状态动作空间的本地值。我们应用了Ludpp来控制一个手指(2 DOF,带有12维的状态动作空间)的阴影迷惑手,一个PAM驱动的人形机器人手。实验结果表明,我们的方法可以实现具有有限的计算资源的这种机器人的成功控制,而基于其他传统的价值函数的RL算法(DPP,LSPI)不能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号