Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints

Qianru Teng; Yimin Chen; Chen Huang

首页> 外文期刊>Future Internet >Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints

【24h】

Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints

机译：具有几何约束的单眼深度，光流和相机姿势的咬合意识无监督学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present an occlusion-aware unsupervised neural network for jointly learning three low-level vision tasks from monocular videos: depth, optical flow, and camera motion. The system consists of three different predicting sub-networks simultaneously coupled by combined loss terms and is capable of computing each task independently on test samples. Geometric constraints extracted from scene geometry which have traditionally been used in bundle adjustment or pose-graph optimization are formed as various self-supervisory signals during our end-to-end learning approach. Different from prior works, our image reconstruction loss also takes account of optical flow. Moreover, we impose novel 3D flow consistency constraints over the predictions of all the three tasks. By explicitly modeling occlusion and taking utilization of both 2D and 3D geometry relationships, abundant geometric constraints are formed over estimated outputs, enabling the system to capture both low-level representations and high-level cues to infer thinner scene structures. Empirical evaluation on the KITTI dataset demonstrates the effectiveness and improvement of our approach: (1) monocular depth estimation outperforms state-of-the-art unsupervised methods and is comparable to stereo supervised ones; (2) optical flow prediction ranks top among prior works and even beats supervised and traditional ones especially in non-occluded regions; (3) pose estimation outperforms established SLAM systems under comparable input settings with a reasonable margin.

机译：我们提出了一种可识别遮挡的无监督神经网络，用于从单眼视频中共同学习三个低级视觉任务：深度，光流和相机运动。该系统由三个不同的预测子网组成，这些子网同时通过组合的损耗项耦合，并且能够独立于测试样本计算每个任务。在我们的端到端学习方法中，从场景几何中提取的几何约束（传统上已用于束调整或姿势图优化）形成为各种自我监控信号。与先前的工作不同，我们的图像重建损失还考虑了光流。此外，我们在所有三个任务的预测上强加了新颖的3D流一致性约束。通过显式建模遮挡并利用2D和3D几何关系，在估计的输出上形成了丰富的几何约束，使系统能够捕获低级表示和高级提示以推断出较薄的场景结构。对KITTI数据集的实证评估证明了我们方法的有效性和改进：（1）单眼深度估计优于最新的无监督方法，可与立体监督方法相提并论; （2）光流预测在以前的工作中排名最高，甚至在有监督和传统的节拍中尤为突出，尤其是在非遮挡区域。（3）姿态估计在可比较的输入设置下以合理的余量胜过已建立的SLAM系统。

著录项

来源
《Future Internet》 |2018年第10期|共14页
作者
Qianru Teng; Yimin Chen; Chen Huang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词
monocular depthcamera poseoptical flowjoint learningocclusion-awarescene geometry;

机译：单眼深度相机姿势光学流联合学习闭塞感知场景几何;
入库时间 2022-08-18 11:09:30

相似文献

外文文献
中文文献
专利

1. MuDeepNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose Using Multi-view Consistency Loss [J] . Zhang Jun-Ning, Su Qun-Xing, Liu Peng-Yuan, International Journal of Control, Automation, and Systems . 2019,第10期

机译：mudeepnet：使用多视图一致性丢失，无监督学习密集深度，光学流量和相机姿势
2. UnLearnerMC: Unsupervised learning of dense depth and camera pose using mask and cooperative loss [J] . Zhang Junning, Su Qunxing, Liu Pengyuan, Knowledge-Based Systems . 2020,第Mara15期

机译：UnLearnerMC：使用蒙版和协作损失进行无监督学习密集深度和相机姿势
3. Unsupervised depth estimation from monocular videos with hybrid geometric-refined loss and contextual attention [J] . Zhang Mingliang, Ye Xinchen, Fan Xin, Neurocomputing . 2020,第Feba28期

机译：具有混合几何优化损失和上下文关注的单眼视频的无监督深度估计
4. Self-Supervised Learning With Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera [C] . Yuhua Chen, Cordelia Schmid, Cristian Sminchisescu International Conference on Computer Vision . 2019

机译：单眼视频中具有几何约束的自我监督学习：连接流，深度和相机
5. Learning of Dense Optical Flow, Motion and Depth, from Sparse Event Cameras [D] . ?Ye, Chengxi 2019

机译：学习浓密的光学流动，运动和深度，来自稀疏事件摄像机
6. Joint Unsupervised Learning of Depth Pose Ground Normal Vector and Ground Segmentation by a Monocular Camera Sensor [O] . Lu Xiong, Yongkun Wen, Yuyao Huang, 2020

机译：单眼相机传感器的联合无监督学习深度姿态地面正常矢量和地面分割
7. Self-Supervised Learning With Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera [O] . Yuhua Chen, Cordelia Schmid, Cristian Sminchisescu 2019

机译：单眼视频中的几何限制自我监督学习：连接流动，深度和相机

Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints

摘要

著录项

相似文献

相关主题

期刊订阅