首页> 外文会议>International joint conference on artificial intelligence;IJCAI-11 >Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application
【24h】

Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application

机译:在增强学习中将案例用作启发式方法:一种迁移学习应用程序

获取原文

摘要

In this paper we propose to combine three AI techniques to speed up a Reinforcement Learning algorithm in a Transfer Learning problem: Case-based Reasoning, Heuristically Accelerated Reinforcement Learning and Neural Networks. To do so, we propose a new algorithm, called L3, which works in 3 stages: in the first stage, it uses Reinforcement Learning to learn how to perform one task, and stores the optimal policy for this problem as a case-base; in the second stage, it uses a Neural Network to map actions from one domain to actions in the other domain and; in the third stage, it uses the case-base learned in the first stage as heuristics to speed up the learning performance in a related, but different, task. The RL algorithm used in the first phase is the Q-learning and in the third phase is the recently proposed Case-based Heuristically Accelerated Q-learning. A set of empirical evaluations were conducted in transferring the learning between two domains, the Acrobot and the Robocup 3D: the policy learned during the solution of the Acrobot Problem is transferred and used to speed up the learning of stability policies for a hu-manoid robot in the Robocup 3D simulator. The results show that the use of this algorithm can lead to a significant improvement in the performance of the agent.
机译:在本文中,我们建议结合三种AI技术来加快转移学习问题中的强化学习算法:基于案例的推理,启发式加速强化学习和神经网络。为此,我们提出了一种称为L3的新算法,该算法分3个阶段起作用:在第一阶段,它使用强化学习来学习如何执行一项任务,并将针对该问题的最佳策略存储为案例库;在第二阶段,它使用神经网络将一个域中的操作映射到另一域中的操作,并且;在第三阶段中,它将第一阶段中学习的案例库用作启发式方法,以加快相关但不同的任务中的学习性能。第一阶段使用的RL算法是Q学习,第三阶段使用的是最近提出的基于案例的启发式Q学习。在Acrobot和Robocup 3D这两个领域之间转移学习时,进行了一组实证评估:转移了Acrobot问题解决过程中所学到的策略,并用于加速对类人机器人的稳定性策略的学习。在Robocup 3D模拟器中。结果表明,使用此算法可以大大改善代理的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号