首页> 外文会议>IEEE International Conference on Robotics and Automation >Target-driven visual navigation in indoor scenes using deep reinforcement learning
【24h】

Target-driven visual navigation in indoor scenes using deep reinforcement learning

机译:使用深度强化学习在室内场景中以目标驱动的视觉导航

获取原文

摘要

Two less addressed issues of deep reinforcement learning are (1) lack of generalization capability to new goals, and (2) data inefficiency, i.e., the model requires several (and often costly) episodes of trial and error to converge, which makes it impractical to be applied to real-world scenarios. In this paper, we address these two issues and apply our model to target-driven visual navigation. To address the first issue, we propose an actor-critic model whose policy is a function of the goal as well as the current state, which allows better generalization. To address the second issue, we propose the AI2-THOR framework, which provides an environment with high-quality 3D scenes and a physics engine. Our framework enables agents to take actions and interact with objects. Hence, we can collect a huge number of training samples efficiently. We show that our proposed method (1) converges faster than the state-of-the-art deep reinforcement learning methods, (2) generalizes across targets and scenes, (3) generalizes to a real robot scenario with a small amount of fine-tuning (although the model is trained in simulation), (4) is end-to-end trainable and does not need feature engineering, feature matching between frames or 3D reconstruction of the environment.
机译:深度强化学习的两个未得到解决的问题是(1)缺乏对新目标的泛化能力,以及(2)数据效率低下,即该模型需要多次(且往往是昂贵的)试验和错误来收敛,这使其不切实际。应用于实际场景。在本文中,我们解决了这两个问题,并将我们的模型应用于目标驱动的视觉导航。为了解决第一个问题,我们提出了一个行为者评论模型,该模型的策略是目标以及当前状态的函数,可以更好地进行泛化。为了解决第二个问题,我们提出了AI2-THOR框架,该框架为环境提供了高质量的3D场景和物理引擎。我们的框架使代理能够采取行动并与对象进行交互。因此,我们可以有效地收集大量的训练样本。我们表明,我们提出的方法(1)的融合速度比最新的深度强化学习方法快;(2)可以跨目标和场景进行概括;(3)可以归纳为具有少量精细信息的真实机器人场景调整(尽管模型是在模拟中训练的),(4)是端到端可训练的,不需要要素工程,框架之间的要素匹配或环境的3D重建。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号