...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
【24h】

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

机译:DARLA:加强强化学习中的零射转移

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain. We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before learning to act. DARLA’s vision is based on learning a disentangled representation of the observed environment. Once DARLA can see, it is able to acquire source policies that are robust to many domain shifts – even with no access to the target domain. DARLA significantly outperforms conventional baselines in zero-shot domain adaptation scenarios, an effect that holds across a variety of RL environments (Jaco arm, DeepMind Lab) and base RL algorithms (DQN, A3C and EC).
机译:领域适应是深度强化学习(RL)中的一个重要的开放问题。在很多情况下,很难获得感兴趣的数据,因此代理可以在易于获得数据的环境中学习源策略,希望它能很好地推广到目标域。我们提出了一个新的多阶段RL代理DARLA(DisentAngled表示学习代理),该代理在学习行动之前先进行观察。 DARLA的愿景是基于对观察到的环境的清晰理解。一旦DARLA看到,它就能够获取对许多域转换都具有鲁棒性的源策略-即使没有访问目标域的权限。 DARLA在零击域自适应方案中明显优于传统基准,这种效果在各种RL环境(Jaco arm,DeepMind Lab)和基本RL算法(DQN,A3C和EC)中均有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号