Hallucinative Topological Memory for Zero-Shot Visual Planning

机译：纯射击视觉规划的幻觉拓扑记忆

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In visual planning (VP), an agent learns to plan goal-directed behavior from observations of a dynamical system obtained offline, e.g., images obtained from self-supervised robot interaction. Most previous works on VP approached the problem by planning in a learned latent space, resulting in low-quality visual plans, and difficult training algorithms. Here, instead, we propose a simple VP method that plans directly in image space and displays competitive performance. We build on the semi-parametric topological memory (SPTM) method: image samples are treated as nodes in a graph, the graph connectivity is learned from image sequence data, and planning can be performed using conventional graph search methods. We propose two modifications on SPTM. First, we train an energy-based graph connectivity function using contrastive predictive coding that admits stable training. Second, to allow zero-shot planning in new domains, we learn a conditional VAE model that generates images given a context describing the domain, and use these hallucinated samples for building the connectivity graph and planning. We show that this simple approach significantly outperform the SOTA VP methods, in terms of both plan interpretability and success rate when using the plan to guide a trajectory-following controller. Interestingly, our method can pick up non-trivial visual properties of objects, such as their geometry, and account for it in the plans.

机译：在视觉规划（VP）中，代理学习从离线获得的动态系统的观察中规划目标定向行为，例如，从自我监督机器人交互获得的图像。在VP中最先前的工作通过规划在学习的潜在空间中接近了问题，导致低质量的视觉计划和艰难的训练算法。相反，我们提出了一种简单的VP方法，该方法直接在图像空间中计划并显示竞争性能。我们在半参数拓扑内存（SPTM）方法上构建：图像样本被视为曲线图中的节点，从图像序列数据中了解图形连接，并且可以使用传统的图形搜索方法执行规划。我们提出了两种对SPTM的修改。首先，我们使用承认稳定培训的对比预测编码训练基于能量的图形连接功能。其次，为了允许在新域中的零射击计划，我们学习一个条件VAE模型，以给定描述域的上下文，并使用这些幻觉样本来构建连接图和规划。我们表明，在使用计划以指导轨迹之后的控制器的计划中，这种简单的方法可以显着优于SOTA VP方法。有趣的是，我们的方法可以挑选对象的非琐碎的视觉属性，例如它们的几何形状，并在计划中占据它。

著录项

来源
《International Conference on Machine Learning》|2021年|6215-7000p|共12页
会议地点
作者
Kara Liu; Thanard Kurutach; Christine Tung; Pieter Abbeel; Aviv Tamar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Hallucinating the past: A case of spontaneous and involuntary recall of long-term memories Perspectives on the hemispheric organization of visual memory. [J] . Faber KM, Johnson LN Journal of neurology . 2003,第1期

机译：幻觉过去：长期记忆的自发性和非自愿性回忆案例关于视觉记忆的半球组织的观点。
2. Decision theory, motor planning, and visual memory: deciding where to reach when memory errors are costly [J] . Lerch Rachel A., Sims Chris R. Experimental Brain Research . 2016,第6期

机译：决策理论，运动计划和视觉记忆：决定记忆错误代价高昂时应到达的位置
3. Neural coding of action planning: visual processing or visual memory? [J] . Simona Monaco, Elisa Pellencin, Malfatti Giulia, Journal of vision . 2016,第12期

机译：行动计划的神经编码：视觉处理还是视觉记忆？
4. Hallucinative Topological Memory for Zero-Shot Visual Planning [C] . Kara Liu, Thanard Kurutach, Christine Tung, International Conference on Machine Learning . 2021

机译：纯射击视觉规划的幻觉拓扑记忆
5. Zero-Shot Visual Recognition via Latent Embedding Learning [D] . Wang, Qian. 2018

机译：潜在嵌入学习的零发视觉识别
6. Action Planning Mediates Guidance of Visual Attention from Working Memory [O] . Tobias Feldmann-Wüstefeld, Anna Schubö 2015

机译：行动计划从工作记忆中调解视觉注意力的指导
7. Topological Analysis and Visualization of Cyclical Behavior in Memory Reference Traces [O] . A. N. M. Imroz, Choudhury Bei, Wang Paul, 2016

机译：记忆参考轨迹中循环行为的拓扑分析与可视化

Hallucinative Topological Memory for Zero-Shot Visual Planning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅