首页> 外文期刊>電子情報通信学会技術研究報告. 複雑コミュニケーションサイエンス >A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning
【24h】

A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning

机译:一种基于加强和仿制学习的混合的人类代理

获取原文
获取原文并翻译 | 示例
           

摘要

Reinforcement learning (RL) makes it possible to build an efficient agent that handles tasks in complex and uncertain environments by maximizing future reward. However, for applications in some areas like game AI and autonomous driving, efficiency only cannot satisfy the practical use, and a human-like agent is preferable. On the other hand, in imitation learning (IL) tasks, which trains the agent to mimic actions of expert behavior provided as training data and thereby learns relatively complex tasks while achieving human-like behavior. Unfortunately, the performance of such an agent is generally limited by the expert behavior. Thus, with the aim of training an agent which achieves high performance while retaining a human-like behavior, we propose a method for mixing RL and IL, applicable to both discrete and continuous problems. We used state-of-the-art RL and IL algorithms and trained their respective models independently, before mixing them into the proposed hybrid model.
机译:强化学习(RL)使得可以通过最大化未来的奖励来构建一个有效的代理,这些代理可以在复杂和不确定的环境中处理任务。 然而,对于游戏AI和自主驾驶等领域的应用,效率仅不能满足实际使用,并且优选人类代理。 另一方面,在模仿学习(IL)任务中,该任务培训了代理以模仿作为训练数据提供的专家行为的动作,从而在实现人类的行为时学习相对复杂的任务。 不幸的是,这种代理的性能通常受专家行为的限制。 因此,随着培训在保持人类行为的同时实现高性能的药剂的目的,我们提出了一种混合R1和IL的方法,适用于离散和连续问题。 我们使用最先进的RL和IL算法,并在将它们混合到所提出的混合模型之前,独立地培训了各自的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号