【24h】

Automatic Curriculum Graph Generation for Reinforcement Learning Agents

机译:加固学习代理的自动课程图形生成

获取原文

摘要

In recent years, research has shown that transfer learning methods can be leveraged to construct curricula that sequence a series of simpler tasks such that performance on a final target task is improved. A major limitation of existing approaches is that such curricula are handcrafted by humans that are typically domain experts. To address this limitation, we introduce a method to generate a curriculum based on task descriptors and a novel metric of transfer potential. Our method automatically generates a curriculum as a directed acyclic graph (as opposed to a linear sequence as done in existing work). Experiments in both discrete and continuous domains show that our method produces curricula that improve the agent's learning performance when compared to the baseline condition of learning on the target task from scratch.
机译:近年来,研究表明,可以利用转移学习方法来构建课程,该课程序列是一系列更简单的任务,使得在最终目标任务上的性能得到改善。 现有方法的一个主要限制是,这种课程由通常是域专家的人类手工制作。 为了解决这个限制,我们介绍一种基于任务描述符和传输潜力的新型度量来生成课程的方法。 我们的方法自动生成课程作为定向的非循环图(与在现有工作中完成的线性序列相反)。 离散和连续域中的实验表明,与从头划痕上学习的基线条件相比,我们的方法会产生提高代理的学习表现的课程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号