A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

Xinyu LIAN; Rousslan Fernand Julien DOSSA; Hirokazu NOMOTO; Takashi MATSUBARA; Kuniaki UEHARA

首页> 外文期刊>電子情報通信学会技術研究報告. 複雑コミュニケーションサイエンス >A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

【24h】

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

机译：一种基于加强和仿制学习的混合的人类代理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) makes it possible to build an efficient agent that handles tasks in complex and uncertain environments by maximizing future reward. However, for applications in some areas like game AI and autonomous driving, efficiency only cannot satisfy the practical use, and a human-like agent is preferable. On the other hand, in imitation learning (IL) tasks, which trains the agent to mimic actions of expert behavior provided as training data and thereby learns relatively complex tasks while achieving human-like behavior. Unfortunately, the performance of such an agent is generally limited by the expert behavior. Thus, with the aim of training an agent which achieves high performance while retaining a human-like behavior, we propose a method for mixing RL and IL, applicable to both discrete and continuous problems. We used state-of-the-art RL and IL algorithms and trained their respective models independently, before mixing them into the proposed hybrid model.

机译：强化学习（RL）使得可以通过最大化未来的奖励来构建一个有效的代理，这些代理可以在复杂和不确定的环境中处理任务。然而，对于游戏AI和自主驾驶等领域的应用，效率仅不能满足实际使用，并且优选人类代理。另一方面，在模仿学习（IL）任务中，该任务培训了代理以模仿作为训练数据提供的专家行为的动作，从而在实现人类的行为时学习相对复杂的任务。不幸的是，这种代理的性能通常受专家行为的限制。因此，随着培训在保持人类行为的同时实现高性能的药剂的目的，我们提出了一种混合R1和IL的方法，适用于离散和连续问题。我们使用最先进的RL和IL算法，并在将它们混合到所提出的混合模型之前，独立地培训了各自的模型。

著录项

来源
《電子情報通信学会技術研究報告. 複雑コミュニケーションサイエンス》 |2018年第316期|共6页
作者
Xinyu LIAN; Rousslan Fernand Julien DOSSA; Hirokazu NOMOTO; Takashi MATSUBARA; Kuniaki UEHARA;
展开▼
作者单位

Kobe University Graduate School of System Informatics;

Kobe University Graduate School of System Informatics;

EQUOS RESEARCH Co. Ltd.;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
Human-Like; Hybrid Model; Reinforcement Learning; Imitation Learning; Game AI; Autonomous Driving;

机译：人类;混合模型;加强学习;模仿学习;游戏AI;自主驾驶;

相似文献

外文文献
中文文献
专利

1. A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning [J] . Xinyu LIAN, Rousslan Fernand Julien DOSSA, Hirokazu NOMOTO, 電子情報通信学会技術研究報告. 複雑コミュニケーションサイエンス . 2018,第316期

机译：一种基于加强和仿制学习的混合的人类代理
2. Hybrid of Reinforcement and Imitation Learning for Human-Like Agents [J] . Rousslan F. J. DOSSA, Xinyu LIAN, Hirokazu NOMOTO, IEICE transactions on information and systems . 2020,第9期

机译：诸如人类代理的加固和模仿学习的混合
3. Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework [J] . Mohamed A. Khamis, Walid Gomaa Engineering Applications of Artificial Intelligence . 2014,第mara期

机译：基于合作多智能体框架的交通信号控制自适应多目标强化学习与混合探索
4. A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning [C] . Rousslan Fernand Julien Dossa, Xinyu Lian, Hirokazu Nomoto, International Joint Conference on Neural Networks . 2019

机译：强化与模仿学习相结合的仿人特工
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Hybrid of Reinforcement and Imitation Learning for Human-Like Agents [O] . Rousslan F. J. DOSSA, Xinyu LIAN, Hirokazu NOMOTO, 2020

机译：诸如人类代理的加固和模仿学习的混合

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

摘要

著录项

相似文献

相关主题

期刊订阅