首页> 外文会议>International Joint Conference on Artificial Intelligence >Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application
【24h】

Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application

机译:使用案例作为加固学习中的启发式信息:转移学习申请

获取原文

摘要

In this paper we propose to combine three AI techniques to speed up a Reinforcement Learning algorithm in a Transfer Learning problem: Case-based Reasoning, Heuristically Accelerated Reinforcement Learning and Neural Networks. To do so, we propose a new algorithm, called L3, which works in 3 stages: in the first stage, it uses Reinforcement Learning to learn how to perform one task, and stores the optimal policy for this problem as a case-base; in the second stage, it uses a Neural Network to map actions from one domain to actions in the other domain and; in the third stage, it uses the case-base learned in the first stage as heuristics to speed up the learning performance in a related, but different, task. The RL algorithm used in the first phase is the Q-learning and in the third phase is the recently proposed Case-based Heuristically Accelerated Q-learning. A set of empirical evaluations were conducted in transferring the learning between two domains, the Acrobot and the Robocup 3D: the policy learned during the solution of the Acrobot Problem is transferred and used to speed up the learning of stability policies for a humanoid robot in the Robocup 3D simulator. The results show that the use of this algorithm can lead to a significant improvement in the performance of the agent.
机译:在本文中,我们建议将三种AI技术结合起来在转移学习问题中加速加强学习算法:基于案例的推理,启发式加速的加强学习和神经网络。为此,我们提出了一种新的算法,称为L3,其工作在3个阶段:在第一阶段,它使用强化学习来学习如何执行一个任务,并将此问题的最佳策略存储在一个问题上;在第二阶段,它使用神经网络将来自一个域的动作映射到另一个域中的动作和;在第三阶段,它使用了在第一阶段中学到的案例基础作为启发式,以加快相关的学习性能,但不同但不同的任务。第一阶段中使用的RL算法是Q-Learning并且在第三阶段是最近提出的基于案例的启发式加速Q-Learning。在转移两个域之间的学习,Acrobot和Robocup 3D之间进行了一系列经验评估:转移了在解决icrobot问题的解决方案中学到的政策,并用于加快对人形机器人的稳定性政策的学习Robocup 3D模拟器。结果表明,使用该算法可能导致代理的性能显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号