首页> 外文会议>International Conference on Computer Vision >Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM

【24h】

Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM

机译：面向单眼深SLAM的关键帧检测和视觉里程表的无监督协作学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we tackle the joint learning problem of keyframe detection and visual odometry towards monocular visual SLAM systems. As an important task in visual SLAM, keyframe selection helps efficient camera relocalization and effective augmentation of visual odometry. To benefit from it, we first present a deep network design for the keyframe selection, which is able to reliably detect keyframes and localize new frames, then an end-to-end unsupervised deep framework further proposed for simultaneously learning the keyframe selection and the visual odometry tasks. As far as we know, it is the first work to jointly optimize these two complementary tasks in a single deep framework. To make the two tasks facilitate each other in the learning, a collaborative optimization loss based on both geometric and visual metrics is proposed. Extensive experiments on publicly available datasets (ie~KITTI raw dataset and its odometry split) clearly demonstrate the effectiveness of the proposed approach, and new state-of-the-art results are established on the unsupervised depth and pose estimation from monocular videos.

机译：在本文中，我们针对单目视觉SLAM系统解决了关键帧检测和视觉里程表的联合学习问题。作为视觉SLAM中的一项重要任务，关键帧选择有助于有效地重新定位相机并有效扩大视觉里程表。为了从中受益，我们首先提出一种用于关键帧选择的深度网络设计，它能够可靠地检测关键帧并定位新帧，然后进一步提出了一种端到端无监督的深度框架，用于同时学习关键帧选择和可视化。里程表任务。据我们所知，这是在单个深度框架中共同优化这两个互补任务的第一项工作。为了使这两个任务在学习中相互促进，提出了基于几何和视觉指标的协同优化损失。在公开可用的数据集（即〜KITTI原始数据集及其里程表拆分）上进行的广泛实验清楚地证明了该方法的有效性，并且在单眼视频的无监督深度和姿势估计基础上建立了最新技术成果。

著录项

来源
《International Conference on Computer Vision》|2019年|4301-4310|共10页
会议地点 Seoul(KR)
作者
Lu Sheng; Dan Xu; Wanli Ouyang; Xiaogang Wang;
展开▼
作者单位

Beihang University;

University of Oxford;

The University of Sydney;

Chinese University of Hong Kong. Hong Kong;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visual odometry; Visualization; Simultaneous localization and mapping; Task analysis; Cameras; Machine learning; Optimization;

机译：视觉里程表；可视化；同时定位和映射；任务分析；相机；机器学习；优化;

相似文献

外文文献
中文文献
专利

1. DeepSLAM: A Robust Monocular SLAM System With Unsupervised Deep Learning [J] . Ruihao Li, Sen Wang, Dongbing Gu Industrial Electronics, IEEE Transactions on . 2021,第4期

机译：深层羊水：一种具有无人监督的深度学习的强大单眼血液系统
2. Learning Kalman Network: A deep monocular visual odometry for on-road driving [J] . Zhao Cheng, Sun Li, Yan Zhi, Robotics and Autonomous Systems . 2019,第期

机译：学习卡尔曼网络：道路驾驶的深层单眼视觉径量
3. LIFT-SLAM: A deep-learning feature-based monocular visual SLAM method [J] . Silva Bruno Hudson Martins, Colombini Esther Luna Neurocomputing . 2021,第Sepa30期

机译：升降机：基于深度学习的功能的单眼视觉SLAM方法
4. Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM [C] . Lu Sheng, Dan Xu, Wanli Ouyang, International Conference on Computer Vision . 2019

机译：无监督的关键帧检测和视觉径管对单眼深弹性的协作学习
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Stereo Visual Odometry Pose Correction through Unsupervised Deep Learning [O] . Sumin Zhang, Shouyi Lu, Rui He, 2021

机译：立体声视觉内径术通过无监督的深度学习弥补
7. Using Unsupervised Deep Learning Technique for Monocular Visual Odometry [O] . Qiang Liu, Ruihao Li, Huosheng Hu, 2019

机译：使用无监督的深度学习技术进行单眼视觉测量

Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM

摘要

著录项

相似文献

相关主题

期刊订阅