Target-driven visual navigation in indoor scenes using deep reinforcement learning

机译：使用深度强化学习在室内场景中以目标驱动的视觉导航

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Two less addressed issues of deep reinforcement learning are (1) lack of generalization capability to new goals, and (2) data inefficiency, i.e., the model requires several (and often costly) episodes of trial and error to converge, which makes it impractical to be applied to real-world scenarios. In this paper, we address these two issues and apply our model to target-driven visual navigation. To address the first issue, we propose an actor-critic model whose policy is a function of the goal as well as the current state, which allows better generalization. To address the second issue, we propose the AI2-THOR framework, which provides an environment with high-quality 3D scenes and a physics engine. Our framework enables agents to take actions and interact with objects. Hence, we can collect a huge number of training samples efficiently. We show that our proposed method (1) converges faster than the state-of-the-art deep reinforcement learning methods, (2) generalizes across targets and scenes, (3) generalizes to a real robot scenario with a small amount of fine-tuning (although the model is trained in simulation), (4) is end-to-end trainable and does not need feature engineering, feature matching between frames or 3D reconstruction of the environment.

机译：深度强化学习的两个未得到解决的问题是（1）缺乏对新目标的泛化能力，以及（2）数据效率低下，即该模型需要多次（且往往是昂贵的）试验和错误来收敛，这使其不切实际。应用于实际场景。在本文中，我们解决了这两个问题，并将我们的模型应用于目标驱动的视觉导航。为了解决第一个问题，我们提出了一个行为者评论模型，该模型的策略是目标以及当前状态的函数，可以更好地进行泛化。为了解决第二个问题，我们提出了AI2-THOR框架，该框架为环境提供了高质量的3D场景和物理引擎。我们的框架使代理能够采取行动并与对象进行交互。因此，我们可以有效地收集大量的训练样本。我们表明，我们提出的方法（1）的融合速度比最新的深度强化学习方法快;（2）可以跨目标和场景进行概括;（3）可以归纳为具有少量精细信息的真实机器人场景调整（尽管模型是在模拟中训练的），（4）是端到端可训练的，不需要要素工程，框架之间的要素匹配或环境的3D重建。

著录项

来源
《IEEE International Conference on Robotics and Automation》|2017年|3357-3364|共8页
会议地点
作者
Yuke Zhu; Roozbeh Mottaghi; Eric Kolve; Joseph J. Lim; Abhinav Gupta; Li Fei-Fei; Ali Farhadi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Navigation; Training; Visualization; Learning (artificial intelligence); Three-dimensional displays; Physics; Robots;

机译：导航;培训;可视化;学习（人工智能）;三维显示;物理;机器人;

相似文献

外文文献
中文文献
专利

1. Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning [J] . Devo Alessandro, Mezzetti Giacomo, Costante Gabriele, IEEE Transactions on Robotics . 2020,第5期

机译：利用深增强学习，拓展目标驱动的视觉导航中的概括
2. Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning [J] . Kulhanek Jonas, Derner Erik, Babuska Robert IEEE Robotics and Automation Letters . 2021,第3期

机译：现实世界室内环境中的视觉导航使用端到端的深度加强学习
3. Multi goals and multi scenes visual mapless navigation in indoor using meta-learning and scene priors [J] . Li Fei, Guo Chi, Luo Binhan, Neurocomputing . 2021,第Auga18期

机译：多目标和多场景在室内使用元学习和场景前瞻
4. Target-driven visual navigation in indoor scenes using deep reinforcement learning [C] . Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, IEEE International Conference on Robotics and Automation . 2017

机译：使用深增强学习的室内场景中的目标驱动的视觉导航
5. UAV Navigation, Tracking, and Interception Using Deep Reinforcement Learning [D] . Darwish, Ali A. 2020

机译：UAV导航，跟踪和拦截使用深度加强学习
6. Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning [O] . Xinyu Zhang, Chengbo Wang, Yuanchang Liu, 2019

机译：基于场景划分和深度强化学习的水面自主舰艇自主航行决策
7. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning [O] . Zhu, Yuke, Mottaghi, Roozbeh, Kolve, Eric, 2016

机译：利用Deep实现室内场景中的目标驱动视觉导航强化学习

Target-driven visual navigation in indoor scenes using deep reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅