首页> 外文会议>Conference on Neural Information Processing Systems >Adaptive Auxiliary Task Weighting for Reinforcement Learning
【24h】

Adaptive Auxiliary Task Weighting for Reinforcement Learning

机译:强化学习的自适应辅助任务加权

获取原文

摘要

Reinforcement learning is known to be sample inefficient, preventing its application to many real-world problems, especially with high dimensional observations like images. Transferring knowledge from other auxiliary tasks is a powerful tool for improving the learning efficiency. However, the usage of auxiliary tasks has been limited so far due to the difficulty in selecting and combining different auxiliary tasks. In this work, we propose a principled online learning algorithm that dynamically combines different auxiliary tasks to speed up training for reinforcement learning. Our method is based on the idea that auxiliary tasks should provide gradient directions that, in the long term, help to decrease the loss of the main task. We show in various environments that our algorithm can effectively combine a variety of different auxiliary tasks and achieves significant speedup compared to previous heuristic approaches of adapting auxiliary task weights.
机译:已知加强学习是样本效率低,防止其应用于许多现实世界问题,特别是具有像图像的高尺寸观测。 从其他辅助任务转移知识是一种提高学习效率的强大工具。 然而,到目前为止,辅助任务的使用是有限的,因为难以选择和结合不同的辅助任务。 在这项工作中,我们提出了一个原则的在线学习算法,它动态地结合了不同的辅助任务,以加速加强学习的培训。 我们的方法是基于辅助任务应提供梯度方向的想法,即长期帮助降低主要任务丢失。 我们在各种环境中展示了我们的算法可以有效地结合各种不同的辅助任务,并且与先前的调整辅助任务权重的启发式方法相比,实现了显着的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号