Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

机译：将地区自然演员 - 评论家批评建筑应用于电机原始学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natural stochastic policy gradients while the critic obtains the natural policy gradient by linear regression. We show that this architecture can be used to learn the "building blocks of movement generation", called motor primitives. Motor primitives are parameterized control policies such as splines or nonlinear differential equations with desired attractor properties. We show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.

机译：在本文中，我们调查了与自然演员 - 评论家方法的运动原始学习。自然演员 - 评论家包括使用自然随机政策梯度实现的演员更新，而评论家通过线性回归获得自然政策梯度。我们表明，这种架构可用于学习称为电机基元的“机动生成块”。电机基元是参数化控制策略，如具有所需吸引子属性的花键或非线性微分方程。我们表明，我们最现代的算法，焦点自然演员 - 评论家以至少一种数量级占上了先前的算法。我们展示了这种加强学习方法在学习击打棒球的应用中的效率，用拟人机器人臂击打棒球。

著录项

来源
《European Symposium on Artificial Neural Networks》|2007年||共6页
会议地点
作者
Jan Peters; Stefan Schaal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词

相似文献

外文文献
中文文献
专利

1. An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game [J] . Wang Xiao, Shi Peng, Schwartz Howard, Proceedings of the Institution of Mechanical Engineers . 2021,第14期

机译：固定时间空间差异游戏申请普里雷普雷斯模糊演员 - 评论家算法
2. Adaptive actor-critic learning for the control of mobile robots by applying predictive models [J] . Syam R, Watanabe K, Izumi K Soft computing: A fusion of foundations, methodologies and applications . 2005,第11期

机译：通过应用预测模型对移动机器人进行控制的自适应行为者批判学习
3. A priori-knowledge/actor-critic reinforcement learning architecture for computing the mean-variance customer portfolio: The case of bank marketing campaigns [J] . Emma M. Sanchez, Julio B. Clempner, Alexander S. Poznyak Engineering Applications of Artificial Intelligence . 2015,第NOVaPTaA期

机译：用于计算均值方差客户组合的先验知识/行为者批评强化学习架构：银行营销活动的案例
4. Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning [C] . Jan Peters, Stefan Schaal European Symposium on Artificial Neural Networks . 2007

机译：将地区自然演员 - 评论家批评建筑应用于电机原始学习
5. A New Class of Neural Architectures to Model Episodic Memory: Computational Studies of Distal Reward Learning. [D] . Taylor, Shawn E. 2012

机译：用于模拟情景记忆的新型神经体系结构：远距奖励学习的计算研究。
6. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives [O] . Cai Li, Robert Lowe, Tom Ziemke 2014

机译：运动学习的新方法：使用中央模式生成器和动态运动原语的Actor-Critic体系结构
7. A Novel Approach to Locomotion Learning: Actor-Critic Architecture using Central Pattern Generators and Dynamic Motor Primitives [O] . Cai eLi, Robert eLowe, Tom eZiemke 2014

机译：一种新的运动学习方法：使用中央模式发生器和动态运动原语的演员批评结构
8. Research in Architectural Approaches to the Integration of Empirical, Analytic and Episodic Learning within SOAR [R] . Laird, J. E. 2005

机译：sOaR中经验，分析和情景学习整合的建构方法研究

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

摘要

著录项

相似文献

相关主题

期刊订阅