首页> 外文会议>International Joint Conference on Artificial Intelligence >Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer
【24h】

Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer

机译:来自多个教师的理论上接地的政策建议,并在加强学习环境中,应用于负转移

获取原文

摘要

Policy advice is a transfer learning method where a student agent is able to learn faster via advice from a teacher. However, both this and other reinforcement learning transfer methods have little theoretical analysis. This paper formally defines a setting where multiple teacher agents can provide advice to a student and introduces an algorithm to leverage both autonomous exploration and teacher's advice. Our regret bounds justify the intuition that good teachers help while bad teachers hurt. Using our formalization, we are also able to quantify, for the first time, when negative transfer can occur within such a reinforcement learning setting.
机译:政策建议是一项转移学习方法,学生代理能够通过教师的建议更快地学习。然而,这两个和其他加强学习转移方法都几乎没有理论分析。本文正式定义了多个教师代理商可以向学生提供建议并介绍一种算法,以利用自主探索和教师的建议。我们的遗憾界定了良好的教师帮助的直觉,而糟糕的教师受伤。使用我们的形式化,我们还能够在这种加强学习环境中发生负转移时第一次量化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号