首页> 外文会议>International Joint Conference on Neural Networks >A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning
【24h】

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

机译:一种基于泛削和仿制学习的混合的人类代理

获取原文

摘要

Reinforcement learning (RL) builds an effective agent that handles tasks in complex and uncertain environments by maximizing future reward. However, the efficiency is insufficient for practical use such as game AI and autonomous driving. An effective but selfish agent conflicts with other humans, and hence the demand of a human-like behavior arises. Imitation learning (IL) has been employed to train an agent to mimic the actions of expert behaviors provided as training data. However, IL tends to build an agent limited in performance by the expert skill, and even worse, the agent exhibits an inconsistent behavior since IL is not goal-oriented. In this paper, we propose a training scheme by mixing RL and IL for both discrete and continuous action space problems. The proposed scheme builds an agent that achieves a performance higher than an agent trained by only IL and exhibits a more human-like behavior than agents trained by RL or IL, validated by human sensitivity.
机译:强化学习(RL)通过最大化未来奖励来构建一个有效的代理,可以处理复杂和不确定环境中的任务。然而,效率不足以进行实际使用,例如游戏AI和自主驾驶。有效但自私的代理与其他人发生冲突,因此出现了人类的行为的需求。仿制学习(IL)已被用于培训代理人以模仿作为培训数据提供的专家行为的行为。然而,IL倾向于通过专家技能构建性能有限的代理,甚至更差,代理表现出不一致的行为,因为IL不是目标导向。在本文中,我们通过混合R1和IL来提出培训方案,以实现离散和连续的行动空间问题。该方案建立了一种代理商,该代理商实现了高于仅由IL培训的代理的性能,并且表现出比人类敏感性验证的RL或IL训练的药剂更具人类的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号