Real-Time RGB-D Camera Pose Estimation in Novel Scenes Using a Relocalisation Cascade

Cavallari Tommaso; Golodetz Stuart; Lord Nicholas A.; Valentin Julien; Prisacariu Victor A.; Di Stefano Luigi; Torr Philip H. S.

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Real-Time RGB-D Camera Pose Estimation in Novel Scenes Using a Relocalisation Cascade

【24h】

Real-Time RGB-D Camera Pose Estimation in Novel Scenes Using a Relocalisation Cascade

机译：使用retocalisation级联的新颖场景中的实时RGB-D相机介绍

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Camera pose estimation is an important problem in computer vision, with applications as diverse as simultaneous localisation and mapping, virtual/augmented reality and navigation. Common techniques match the current image against keyframes with known poses coming from a tracker, directly regress the pose, or establish correspondences between keypoints in the current image and points in the scene in order to estimate the pose. In recent years, regression forests have become a popular alternative to establish such correspondences. They achieve accurate results, but have traditionally needed to be trained offline on the target scene, preventing relocalisation in new environments. Recently, we showed how to circumvent this limitation by adapting a pre-trained forest to a new scene on the fly. The adapted forests achieved relocalisation performance that was on par with that of offline forests, and our approach was able to estimate the camera pose in close to real time, which made it desirable for systems that require online relocalisation. In this paper, we present an extension of this work that achieves significantly better relocalisation performance whilst running fully in real time. To achieve this, we make several changes to the original approach: (i) instead of simply accepting the camera pose hypothesis produced by RANSAC without question, we make it possible to score the final few hypotheses it considers using a geometric approach and select the most promising one; (ii) we chain several instantiations of our relocaliser (with different parameter settings) together in a cascade, allowing us to try faster but less accurate relocalisation first, only falling back to slower, more accurate relocalisation as necessary; and (iii) we tune the parameters of our cascade, and the individual relocalisers it contains, to achieve effective overall performance. Taken together, these changes allow us to significantly improve upon the performance our original state-of-the-art method was able to achieve on the well-known 7-Scenes and Stanford 4 Scenes benchmarks. As additional contributions, we present a novel way of visualising the internal behaviour of our forests, and use the insights gleaned from this to show how to entirely circumvent the need to pre-train a forest on a generic scene.

机译：相机姿态估计是计算机愿景中的一个重要问题，具有多样化的应用，作为同时定位和映射，虚拟/增强现实和导航。公共技术与来自跟踪器的已知姿势的关键帧匹配当前图像，直接回归姿势，或在当前图像中的关键点之间建立对应关系，并且在场景中的点以估计姿势。近年来，回归森林已成为建立此类信念的热门替代品。它们实现了准确的结果，但传统上需要在目标场景中离线训练，防止在新环境中剖析。最近，我们展示了如何通过将训练有素的森林调整到一个新场景来绕过这种限制。适应的森林实现了与离线森林相当的重新调查表现，我们的方法能够估计相机姿势接近实时，这使得需要在线重新定位的系统。在本文中，我们展示了这项工作的延伸，这在实时运行时实现了明显更好的重新定位性能。为实现这一目标，我们对原始方法进行了多次变化：（i）而不是简单地接受由Ransac产生的相机姿势假设，而不是毫无疑问，我们可以获得使用几何方法考虑的最终的一些假设，并选择最多承诺的; （ii）我们在级联中将我们的RESOCALISER的几种实例化（具有不同的参数设置），允许我们首先尝试更快但更准确的重新定位，仅落回较慢，更准确地重新定位; （iii）我们调整了我们级联的参数，以及它包含的个人剖视参数，以实现有效的整体性能。在一起，这些变化使我们能够显着提高我们最初的最先进方法能够在众所周知的7场景和斯坦福4场景基准测试中实现。作为额外的贡献，我们提出了一种可视化我们森林内部行为的新方式，并使用从此获取的洞察力来展示如何完全规避在通用场景上预先训练森林的需要。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2020年第10期|2465-2477|共13页
作者
Cavallari Tommaso; Golodetz Stuart; Lord Nicholas A.; Valentin Julien; Prisacariu Victor A.; Di Stefano Luigi; Torr Philip H. S.;
展开▼
作者单位

FiveAI Ltd Oxford OX1 1ST England;

FiveAI Ltd Oxford OX1 1ST England;

FiveAI Ltd Oxford OX1 1ST England;

Google Inc Mountain View CA 94043 USA;

Univ Oxford Oxford OX1 2JD England;

Univ Bologna I-40126 Bologna Italy;

Univ Oxford Oxford OX1 2JD England;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cameras; Forestry; Three-dimensional displays; Real-time systems; Pose estimation; Impedance matching; Training; Camera pose estimation; relocalisation; RGB-D; online adaptation; cascade;

机译：摄像机;林业;三维显示器;实时系统;姿势估计;阻抗匹配;训练;相机姿势估计;relocalisation;RGB-D;在线适应;级联;级联;

相似文献

外文文献
中文文献
专利

1. Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras [J] . David Joseph Tan, Federico Tombari, Nassir Navab International Journal of Computer Vision . 2018,第2a4期

机译：使用消费者RGB-D相机实时精确的3D头跟踪和姿势估计
2. Robust Robot Pose Estimation for Challenging Scenes With an RGB-D Camera [J] . Yu Hongshan, Fu Qiang, Yang Zhengeng, IEEE sensors journal . 2019,第6期

机译：使用RGB-D摄像机对场景提出挑战的鲁棒机器人姿势估计
3. Robust Robot Pose Estimation for Challenging Scenes With an RGB-D Camera [J] . Yu Hongshan, Fu Qiang, Yang Zhengeng, Nature reviews Cancer . 2019,第6期

机译：强大的机器人姿态估计，具有RGB-D相机的具有挑战性的场景
4. Real-Time Head Pose Estimation by RGB-D Camera [C] . Fan Liu, Jinhui Tang, Yan Song, Pacific-Rim conference on multimedia . 2013

机译：RGB-D摄像头实时头姿势估计
5. Real-Time Capture and Rendering of Physical Scene with an Efficiently Calibrated RGB-D Camera Network [D] . Su, Po-Chang. 2017

机译：通过高效校准的RGB-D摄像机网络实时捕获和渲染物理场景
6. Dynamic Pose Estimation Using Multiple RGB-D Cameras [O] . Sungjin Hong, Yejin Kim 2018

机译：使用多个RGB-D摄像机的动态姿势估计
7. Real-Time RGB-D Camera Pose Estimation in Novel Scenes Using a Relocalisation Cascade [O] . Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, 2020

机译：使用retocalisation级联的新颖场景中的实时RGB-D相机介绍

Real-Time RGB-D Camera Pose Estimation in Novel Scenes Using a Relocalisation Cascade

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅