首页> 外文期刊>Artificial intelligence >Transferring knowledge as heuristics in reinforcement learning: A case-based approach
【24h】

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

机译:在强化学习中以启发式方式传播知识:基于案例的方法

获取原文
获取原文并翻译 | 示例
       

摘要

The goal of this paper is to propose and analyse a transfer learning meta-algorithm that allows the implementation of distinct methods using heuristics to accelerate a Reinforcement Learning procedure in one domain (the target) that are obtained from another (simpler) domain (the source domain). This meta-algorithm works in three stages: first, it uses a Reinforcement Learning step to learn a task on the source domain, storing the knowledge thus obtained in a case base; second, it does an unsupervised mapping of the source-domain actions to the target-domain actions; and, third, the case base obtained in the first stage is used as heuristics to speed up the learning process in the target domain. A set of empirical evaluations were conducted in two target domains: the 3D mountain car (using a learned case base from a 2D simulation) and stability learning for a humanoid robot in the Robocup 3D Soccer Simulator (that uses knowledge learned from the Acrobot domain). The results attest that our transfer learning algorithm outperforms recent heuristically-accelerated reinforcement learning and transfer learning algorithms.
机译:本文的目的是提出并分析一种转移学习元算法,该算法允许使用启发式方法实施不同的方法,以加速从另一个(简单)域(源)获得的一个域(目标)中的强化学习过程。域)。这个元算法的工作分为三个阶段:首先,它使用强化学习步骤在源域上学习任务,并将由此获得的知识存储在案例库中。其次,它将源域操作映射到目标域操作;第三,将在第一阶段获得的案例库用作启发式方法,以加快目标域中的学习过程。在两个目标领域中进行了一组实证评估:3D山地车(使用来自2D模拟的学习案例库)和在Robocup 3D Soccer Simulator中使用人形机器人进行稳定性学习(使用从Acrobot领域学习到的知识) 。结果证明,我们的转移学习算法优于最近的启发式加速强化学习和转移学习算法。

著录项

  • 来源
    《Artificial intelligence》 |2015年第9期|102-121|共20页
  • 作者单位

    Centro Universitario da FEI, Av. Humberto de A.C. Branco, 3972, Sao Bernardo do Campo, Sao Paulo, Cep: 09850-901, Brazil;

    Universidade Federal do ABC (UFABC), Centro de Engenharia, Modelagem e Ciencias Sociais Aplicadas - CECS, Avenida dos Estados, 5001, Santo Andre, Sao Paulo, Cep: 09210-580, Brazil;

    Centro Universitario da FEI, Av. Humberto de A.C. Branco, 3972, Sao Bernardo do Campo, Sao Paulo, Cep: 09850-901, Brazil;

    Technological Institute of Aeronautics (ITA), Praca Marechal Eduardo Gomes, 50, S.J.C., Sao Paulo, Cep: 12.228-900, Brazil;

    IIIA - Artificial Intelligence Research Institute, CSIC - Spanish National Research Council, Campus Universitat Autonoma de Barcelona, 08193 Bellaterra, Catalonia, Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Case-based reasoning; Reinforcement learning; Transfer learning;

    机译:基于案例的推理;强化学习;转移学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号