首页> 外文会议>Conference on Neural Information Processing Systems >Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
【24h】

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

机译:使用她和幻觉的GANS解决视觉任务中的样本复杂性

获取原文

摘要

Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings. We validate our approach on 3D navigation tasks and a simulated robotics application and show marked improvement over baselines derived from previous work.
机译:强化学习(RL)算法通常需要数百万个环境相互作用,以学习稀疏奖励设置中的成功策略。 后古经验重播(她)被引入作为通过改变最初预期的目标来提高不成功的轨迹来提高样本效率的技术。 然而,它不能直接应用于视觉环境,其中目标状态通常是在存在不同视觉特征的情况下的特征。 在这项工作中,我们展示了通过使用在目标相对较少的快照相对较少的快照的生成模型改变代理观测,如何效果令人难看的视觉轨迹。 然后,我们将此模型与她结合使用,以便在视觉设置中培训RL代理。 我们在3D导航任务和模拟机器人应用程序上验证了我们的方法,并显示了从以前的工作的基线显示出明显的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号