首页> 外文会议>European Symposium on Artificial Neural Networks >Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
【24h】

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

机译:将地区自然演员 - 评论家批评建筑应用于电机原始学习

获取原文

摘要

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natural stochastic policy gradients while the critic obtains the natural policy gradient by linear regression. We show that this architecture can be used to learn the "building blocks of movement generation", called motor primitives. Motor primitives are parameterized control policies such as splines or nonlinear differential equations with desired attractor properties. We show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.
机译:在本文中,我们调查了与自然演员 - 评论家方法的运动原始学习。自然演员 - 评论家包括使用自然随机政策梯度实现的演员更新,而评论家通过线性回归获得自然政策梯度。我们表明,这种架构可用于学习称为电机基元的“机动生成块”。电机基元是参数化控制策略,如具有所需吸引子属性的花键或非线性微分方程。我们表明,我们最现代的算法,焦点自然演员 - 评论家以至少一种数量级占上了先前的算法。我们展示了这种加强学习方法在学习击打棒球的应用中的效率,用拟人机器人臂击打棒球。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号