A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

机译：强化与模仿学习相结合的仿人特工

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) builds an effective agent that handles tasks in complex and uncertain environments by maximizing future reward. However, the efficiency is insufficient for practical use such as game AI and autonomous driving. An effective but selfish agent conflicts with other humans, and hence the demand of a human-like behavior arises. Imitation learning (IL) has been employed to train an agent to mimic the actions of expert behaviors provided as training data. However, IL tends to build an agent limited in performance by the expert skill, and even worse, the agent exhibits an inconsistent behavior since IL is not goal-oriented. In this paper, we propose a training scheme by mixing RL and IL for both discrete and continuous action space problems. The proposed scheme builds an agent that achieves a performance higher than an agent trained by only IL and exhibits a more human-like behavior than agents trained by RL or IL, validated by human sensitivity.

机译：强化学习（RL）建立了一个有效的代理，可通过最大化未来回报来处理复杂而不确定的环境中的任务。但是，效率不足以用于诸如游戏AI和自动驾驶的实际使用。一个有效但自私的行为人与其他人发生冲突，因此产生了类似人的行为的需求。模仿学习（IL）已被用来训练代理，以模仿作为训练数据提供的专家行为。但是，IL倾向于构建受专家技能限制的性能的代理，甚至更糟糕的是，由于IL不是面向目标的，因此该代理表现出不一致的行为。在本文中，我们提出了一种针对离散和连续动作空间问题的混合RL和IL的训练方案。所提出的方案构建了一种性能比仅由IL训练的药剂更高的药剂，并且比由RL或IL训练的药剂表现出更像人的行为，这已通过人类敏感性验证。

著录项

来源
《International Joint Conference on Neural Networks》|2019年|1-8|共8页
会议地点
作者
Rousslan Fernand Julien Dossa; Xinyu Lian; Hirokazu Nomoto; Takashi Matsubara; Kuniaki Uehara;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Games; Reinforcement learning; Training data; Training; Neural networks; Task analysis; Autonomous vehicles;

机译：游戏;强化学习;训练数据;训练;神经网络;任务分析;自动驾驶汽车;

相似文献

外文文献
中文文献
专利

1. A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning [J] . Xinyu LIAN, Rousslan Fernand Julien DOSSA, Hirokazu NOMOTO, 電子情報通信学会技術研究報告. 複雑コミュニケーションサイエンス . 2018,第316期

机译：一种基于加强和仿制学习的混合的人类代理
2. Hybrid of Reinforcement and Imitation Learning for Human-Like Agents [J] . Rousslan F. J. DOSSA, Xinyu LIAN, Hirokazu NOMOTO, IEICE transactions on information and systems . 2020,第9期

机译：诸如人类代理的加固和模仿学习的混合
3. PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning [J] . Guillaume Sartoretti, Justin Kerr, Yunfei Shi, IEEE Robotics and Automation Letters . 2019,第3期

机译：主要：通过强化和模仿多智能体学习进行寻路
4. A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning [C] . Rousslan Fernand Julien Dossa, Xinyu Lian, Hirokazu Nomoto, International Joint Conference on Neural Networks . 2019

机译：一种基于泛削和仿制学习的混合的人类代理
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Hybrid of Reinforcement and Imitation Learning for Human-Like Agents [O] . Rousslan F. J. DOSSA, Xinyu LIAN, Hirokazu NOMOTO, 2020

机译：诸如人类代理的加固和模仿学习的混合

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

摘要

著录项

相似文献

相关主题

期刊订阅