Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards

Kathleen M. Jagodnik; Philip S. Thomas; Antonie J. van den Bogert; Michael S. Branicky; Robert F. Kirsch

首页> 外文期刊>IEEE transactions on neural systems and rehabilitation engineering >Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards

【24h】

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards

机译：使用人类产生的奖励训练演员关键性强化学习控制员进行手臂运动

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Functional Electrical Stimulation (FES) employs neuroprostheses to apply electrical current to the nerves and muscles of individuals paralyzed by spinal cord injury to restore voluntary movement. Neuroprosthesis controllers calculate stimulation patterns to produce desired actions. To date, no existing controller is able to efficiently adapt its control strategy to the wide range of possible physiological arm characteristics, reaching movements, and user preferences that vary over time. Reinforcement learning (RL) is a control strategy that can incorporate human reward signals as inputs to allow human users to shape controller behavior. In this paper, ten neurologically intact human participants assigned subjective numerical rewards to train RL controllers, evaluating animations of goal-oriented reaching tasks performed using a planar musculoskeletal human arm simulation. The RL controller learning achieved using human trainers was compared with learning accomplished using human-like rewards generated by an algorithm; metrics included success at reaching the specified target; time required to reach the target; and target overshoot. Both sets of controllers learned efficiently and with minimal differences, significantly outperforming standard controllers. Reward positivity and consistency were found to be unrelated to learning success. These results suggest that human rewards can be used effectively to train RL-based FES controllers.

机译：功能性电刺激（FES）使用神经假体将电流施加到脊髓损伤瘫痪的个体的神经和肌肉，以恢复自愿运动。神经假体控制器计算刺激模式以产生所需的动作。迄今为止，尚无现有控制器能够有效地使其控制策略适应各种可能的生理手臂特性，运动范围以及随时间变化的用户偏好。强化学习（RL）是一种控制策略，可以将人类奖励信号作为输入，以使人类用户能够塑造控制器行为。在本文中，十名神经学完好的人类参与者分配了主观数值奖励来训练RL控制器，评估使用平面肌肉骨骼人手臂模拟执行的目标导向的到达任务的动画。将使用人类教练员进行的RL控制器学习与使用算法产生的类似人的奖励的学习进行了比较；指标包括达到指定目标的成功程度；达到目标所需的时间；和目标超调。两组控制器都能以最小的差异高效学习，大大优于标准控制器。发现奖励的积极性和一致性与学习成功无关。这些结果表明，人类奖励可以有效地用于训练基于RL的FES控制器。

著录项

来源
《IEEE transactions on neural systems and rehabilitation engineering》 |2017年第10期|1892-1905|共14页
作者
Kathleen M. Jagodnik; Philip S. Thomas; Antonie J. van den Bogert; Michael S. Branicky; Robert F. Kirsch;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Iron; Muscles; Learning (artificial intelligence); Physiology; Mathematical model; Graphical user interfaces;

机译：培训;铁;肌肉;学习（人工智能）;生理学;数学模型;图形用户界面;

相似文献

外文文献
中文文献
专利

1. Human-Like Rewards to Train a Reinforcement Learning Controller for Planar Arm Movement [J] . Kathleen M. Jagodnik, Philip S. Thomas, Antonie J. van den Bogert, Human-Machine Systems, IEEE Transactions on . 2016,第5期

机译：像人类一样的奖励，用于训练平面手臂运动的强化学习控制器
2. LEARNING TO CONTROL THE THREE-LINK MUSCULOSKELETAL ARM USING ACTOR-CRITIC REINFORCEMENT LEARNING ALGORITHM DURING REACHING MOVEMENT [J] . Ehsan Tahami, Amir Homayoun Jafari, Ali Fallah Biomedical Engineering: Applications, Basis and Communications . 2014,第5期

机译：在运动过程中使用基于行为准则的强化学习算法来控制三链肌骨骼肌的学习
3. APPLICATION OF AN EVOLUTIONARY ACTOR-CRITIC REINFORCEMENT LEARNING METHOD FOR THE CONTROL OF A THREE-LINK MUSCULOSKELETAL ARM DURING A REACHING MOVEMENT [J] . EHSAN TAHAMI, AMIR HOMAYOUN JAFARI, ALI FALLAH Journal of mechanics in medicine and biology . 2013,第2期

机译：渐进的因果关系批判学习方法在运动过程中控制三链肌骨骼肌的应用
4. Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning [C] . Pilarski Patrick M., Dawson Michael R., Degris Thomas, 2011 IEEE International Conference on Rehabilitation Robotics . 2011

机译：通过演员批评强化学习对肌电假体控制器进行在线人工训练
5. Training Physics-Based Controllers for Articulated Characters with Deep Reinforcement Learning [D] . Biswas, Avishek. 2021

机译：培养基于物理的控制器，用于铰接性的人物，深增强学习
6. Creating a Reinforcement Learning Controller for Functional Electrical Stimulation of a Human Arm [O] . Philip S. Thomas, Michael Branicky, Antonie van den Bogert, -1

机译：创建用于人手臂功能性电刺激的强化学习控制器
7. Human-Like Rewards to Train a Reinforcement Learning Controller for Planar Arm Movement [O] . Kathleen M. Jagodnik, Philip S. Thomas, Antonie J. van den Bogert, 2016

机译：人类奖励为平面臂运动训练加强学习控制器
8. Framing Reinforcement Learning from Human Reward: Reward Positivity, Temporal Discounting, Episodicity, and Performance. [R] . Knox, W. B., Stone, P. 2014

机译：从人类奖励中学习强化学习：奖励积极性，时间贴现，情节性和表现。

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards

摘要

著录项

相似文献

相关主题

期刊订阅