首页> 外文会议>International Conference on Machine Learning >Hallucinative Topological Memory for Zero-Shot Visual Planning
【24h】

Hallucinative Topological Memory for Zero-Shot Visual Planning

机译:纯射击视觉规划的幻觉拓扑记忆

获取原文
获取外文期刊封面目录资料

摘要

In visual planning (VP), an agent learns to plan goal-directed behavior from observations of a dynamical system obtained offline, e.g., images obtained from self-supervised robot interaction. Most previous works on VP approached the problem by planning in a learned latent space, resulting in low-quality visual plans, and difficult training algorithms. Here, instead, we propose a simple VP method that plans directly in image space and displays competitive performance. We build on the semi-parametric topological memory (SPTM) method: image samples are treated as nodes in a graph, the graph connectivity is learned from image sequence data, and planning can be performed using conventional graph search methods. We propose two modifications on SPTM. First, we train an energy-based graph connectivity function using contrastive predictive coding that admits stable training. Second, to allow zero-shot planning in new domains, we learn a conditional VAE model that generates images given a context describing the domain, and use these hallucinated samples for building the connectivity graph and planning. We show that this simple approach significantly outperform the SOTA VP methods, in terms of both plan interpretability and success rate when using the plan to guide a trajectory-following controller. Interestingly, our method can pick up non-trivial visual properties of objects, such as their geometry, and account for it in the plans.
机译:在视觉规划(VP)中,代理学习从离线获得的动态系统的观察中规划目标定向行为,例如,从自我监督机器人交互获得的图像。在VP中最先前的工作通过规划在学习的潜在空间中接近了问题,导致低质量的视觉计划和艰难的训练算法。相反,我们提出了一种简单的VP方法,该方法直接在图像空间中计划并显示竞争性能。我们在半参数拓扑内存(SPTM)方法上构建:图像样本被视为曲线图中的节点,从图像序列数据中了解图形连接,并且可以使用传统的图形搜索方法执行规划。我们提出了两种对SPTM的修改。首先,我们使用承认稳定培训的对比预测编码训练基于能量的图形连接功能。其次,为了允许在新域中的零射击计划,我们学习一个条件VAE模型,以给定描述域的上下文,并使用这些幻觉样本来构建连接图和规划。我们表明,在使用计划以指导轨迹之后的控制器的计划中,这种简单的方法可以显着优于SOTA VP方法。有趣的是,我们的方法可以挑选对象的非琐碎的视觉属性,例如它们的几何形状,并在计划中占据它。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号