DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Irina Higgins; Arka Pal; Andrei Rusu; Loic Matthey; Christopher Burgess; Alexander Pritzel; Matthew Botvinick; Charles Blundell; Alexander Lerchner

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

【24h】

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

机译：DARLA：加强强化学习中的零射转移

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain. We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before learning to act. DARLA’s vision is based on learning a disentangled representation of the observed environment. Once DARLA can see, it is able to acquire source policies that are robust to many domain shifts – even with no access to the target domain. DARLA significantly outperforms conventional baselines in zero-shot domain adaptation scenarios, an effect that holds across a variety of RL environments (Jaco arm, DeepMind Lab) and base RL algorithms (DQN, A3C and EC).

机译：领域适应是深度强化学习（RL）中的一个重要的开放问题。在很多情况下，很难获得感兴趣的数据，因此代理可以在易于获得数据的环境中学习源策略，希望它能很好地推广到目标域。我们提出了一个新的多阶段RL代理DARLA（DisentAngled表示学习代理），该代理在学习行动之前先进行观察。 DARLA的愿景是基于对观察到的环境的清晰理解。一旦DARLA看到，它就能够获取对许多域转换都具有鲁棒性的源策略-即使没有访问目标域的权限。 DARLA在零击域自适应方案中明显优于传统基准，这种效果在各种RL环境（Jaco arm，DeepMind Lab）和基本RL算法（DQN，A3C和EC）中均有效。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第1期|共11页
作者
Irina Higgins; Arka Pal; Andrei Rusu; Loic Matthey; Christopher Burgess; Alexander Pritzel; Matthew Botvinick; Charles Blundell; Alexander Lerchner;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Using Task Descriptions in Lifelong Machine Learning for Improved Performance and Zero-Shot Transfer [J] . Mohammad Rostami, David Isele, Eric Eaton The Journal of Artificial Intelligence Research . 2020,第7期

机译：使用终身机器学习中的任务描述，提高性能和零拍摄传输
2. Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning [J] . Li Aoxue, Lu Zhiwu, Guan Jiechao, International Journal of Computer Vision . 2020,第12期

机译：零射击学习的类层次结构可转移特征和投影学习
3. Zero-shot policy generation in lifelong reinforcement learning [J] . Qian Yi-Ming, Xiong Fang-Zhou, Liu Zhi-Yong Neurocomputing . 2021,第Jula25期

机译：终身加固学习中的零射精政策生成
4. DARLA: Improving Zero-Shot Transfer in Reinforcement Learning [C] . Irina Higgins, Arka Pal, Andrei Rusu, International Conference on Machine Learning . 2018

机译：达拉：改善加固学习中的零射流
5. Improving Deep Reinforcement Learning Using Graph Convolution and Visual Domain Transfer [D] . Niu, Sufeng. 2018

机译：使用Graph卷积和视域传输改善深度钢筋学习
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control [O] . Zhuo Xu, Chen Tang, Masayoshi Tomizuka 2018

机译：基于鲁棒控制的自主车辆零击零钢筋学习驾驶政策转移

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅