首页> 外文会议>International Joint Conference on Neural Networks >A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning
【24h】

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

机译:强化与模仿学习相结合的仿人特工

获取原文

摘要

Reinforcement learning (RL) builds an effective agent that handles tasks in complex and uncertain environments by maximizing future reward. However, the efficiency is insufficient for practical use such as game AI and autonomous driving. An effective but selfish agent conflicts with other humans, and hence the demand of a human-like behavior arises. Imitation learning (IL) has been employed to train an agent to mimic the actions of expert behaviors provided as training data. However, IL tends to build an agent limited in performance by the expert skill, and even worse, the agent exhibits an inconsistent behavior since IL is not goal-oriented. In this paper, we propose a training scheme by mixing RL and IL for both discrete and continuous action space problems. The proposed scheme builds an agent that achieves a performance higher than an agent trained by only IL and exhibits a more human-like behavior than agents trained by RL or IL, validated by human sensitivity.
机译:强化学习(RL)建立了一个有效的代理,可通过最大化未来回报来处理复杂而不确定的环境中的任务。但是,效率不足以用于诸如游戏AI和自动驾驶的实际使用。一个有效但自私的行为人与其他人发生冲突,因此产生了类似人的行为的需求。模仿学习(IL)已被用来训练代理,以模仿作为训练数据提供的专家行为。但是,IL倾向于构建受专家技能限制的性能的代理,甚至更糟糕的是,由于IL不是面向目标的,因此该代理表现出不一致的行为。在本文中,我们提出了一种针对离散和连续动作空间问题的混合RL和IL的训练方案。所提出的方案构建了一种性能比仅由IL训练的药剂更高的药剂,并且比由RL或IL训练的药剂表现出更像人的行为,这已通过人类敏感性验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号