Adaptive Auxiliary Task Weighting for Reinforcement Learning

机译：强化学习的自适应辅助任务加权

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning is known to be sample inefficient, preventing its application to many real-world problems, especially with high dimensional observations like images. Transferring knowledge from other auxiliary tasks is a powerful tool for improving the learning efficiency. However, the usage of auxiliary tasks has been limited so far due to the difficulty in selecting and combining different auxiliary tasks. In this work, we propose a principled online learning algorithm that dynamically combines different auxiliary tasks to speed up training for reinforcement learning. Our method is based on the idea that auxiliary tasks should provide gradient directions that, in the long term, help to decrease the loss of the main task. We show in various environments that our algorithm can effectively combine a variety of different auxiliary tasks and achieves significant speedup compared to previous heuristic approaches of adapting auxiliary task weights.

机译：已知加强学习是样本效率低，防止其应用于许多现实世界问题，特别是具有像图像的高尺寸观测。从其他辅助任务转移知识是一种提高学习效率的强大工具。然而，到目前为止，辅助任务的使用是有限的，因为难以选择和结合不同的辅助任务。在这项工作中，我们提出了一个原则的在线学习算法，它动态地结合了不同的辅助任务，以加速加强学习的培训。我们的方法是基于辅助任务应提供梯度方向的想法，即长期帮助降低主要任务丢失。我们在各种环境中展示了我们的算法可以有效地结合各种不同的辅助任务，并且与先前的调整辅助任务权重的启发式方法相比，实现了显着的加速。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p3969-4770|共12页
会议地点
作者
Xingyu Lin; Harjatin Singh Baweja; George Kantor; David Held;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. Multiple metric learning with query adaptive weights and multi-task re-weighting for person re-identification [J] . Jieru Jia, Qiuqi Ruan, Gaoyun An, Computer vision and image understanding . 2017,第jula期

机译：具有查询自适应权重和多任务重新加权的多度量学习，用于人员重新识别
2. Adaptive Task Offloading in Vehicular Edge Computing Networks: a Reinforcement Learning Based Scheme [J] . Zhang Jie, Guo Hongzhi, Liu Jiajia Mobile networks & applications . 2020,第5期

机译：车辆边缘计算网络中的自适应任务卸载：基于加强学习的方案
3. Adaptive task scheduling in IoT using reinforcement learning [J] . Mohammad Khalid Pandit, Roohie Naaz Mir, Mohammad Ahsan Chishti International Journal of Intelligent Computing and Cybernetics . 2020,第3期

机译：使用强化学习的IOT自适应任务调度
4. Adaptive Auxiliary Task Weighting for Reinforcement Learning [C] . Xingyu Lin, Harjatin Singh Baweja, George Kantor, Conference on Neural Information Processing Systems . 2020

机译：强化学习的自适应辅助任务加权
5. Reinforcement Learning with Auxiliary Memory [D] . Suggs, Sterling. 2021

机译：用辅助记忆进行加固学习
6. Assisting Main Task Learning by Heterogeneous Auxiliary Tasks with Applications to Skin Cancer Screening [O] . Ning Situ, Xiaojing Yuan, George Zouridakis -1

机译：与应用皮肤癌筛查协助主要任务学习非均相辅助任务
7. Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning [O] . Jin Wang, Jia Hu, Geyong Min, 2021

机译：基于元增强学习的边缘计算快速自适应任务

Adaptive Auxiliary Task Weighting for Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅