High resolution depth reconstruction from monocular images and sparse point clouds using deep convolutional neural network

机译：使用深度卷积神经网络从单眼图像和稀疏点云进行高分辨率深度重构

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Understanding the 3D structure of the environment is advantageous for many tasks in the field of robotics and autonomous vehicles. From the robot's point of view, 3D perception is often formulated as a depth image reconstruction problem. In the literature, dense depth images are often recovered deterministically from stereo image disparities. Other systems use an expensive LiDAR sensor to produce accurate, but semi-sparse depth images. With the advent of deep learning there have also been attempts to estimate depth by only using monocular images. In this paper we combine the best of the two worlds, focusing on a combination of monocular images and low cost LiDAR point clouds. We explore the idea that very sparse depth information accurately captures the global scene structure while variations in image patches can be used to reconstruct local depth to a high resolution. The main contribution of this paper is a supervised learning depth reconstruction system based on a deep convolutional neural network. The network is trained on RGB image patches reinforced with sparse depth information and the output is a depth estimate for each pixel. Using image and point cloud data from the KITTI vision dataset we are able to learn a correspondence between local RGB information and local depth, while at the same time preserving the global scene structure. Our results are evaluated on sequences from the KITTI dataset and our own recordings using a low cost camera and LiDAR setup.

机译：了解环境的3D结构对于机器人技术和自动驾驶汽车领域的许多任务非常有利。从机器人的角度来看，3D感知通常被表述为深度图像重建问题。在文献中，通常从立体图像视差确定性地恢复密集深度图像。其他系统使用昂贵的LiDAR传感器来生成准确但半稀疏的深度图像。随着深度学习的到来，也已经尝试仅通过使用单眼图像来估计深度。在本文中，我们结合了两个方面的优势，重点放在单眼图像和低成本LiDAR点云的结合上。我们探索这样的想法，即非常稀疏的深度信息可以准确地捕获全局场景结构，而图像补丁中的变化可用于将局部深度重构为高分辨率。本文的主要贡献是基于深度卷积神经网络的监督学习深度重建系统。在经过稀疏深度信息增强的RGB图像斑块上训练网络，并且输出是每个像素的深度估计。使用KITTI视觉数据集中的图像和点云数据，我们可以了解局部RGB信息和局部深度之间的对应关系，同时保留全局场景结构。我们使用KITTI数据集的序列对结果进行评估，并使用低成本相机和LiDAR设置对我们自己的记录进行了评估。

著录项

来源
《Unconventional and Indirect Imaging, Image Reconstruction, and Wavefront Sensing 2017》|2017年|104100H.1-104100H.9|共9页
会议地点 San Diego(US)
作者
Martin Dimitrievski; Bart Goossens; Peter Veelaert; Wilfried Philips;
展开▼
作者单位

imec-IPI-Ghent University, St-Pietersnieuwstraat 41, B-9000 Gent, Belgium;

imec-IPI-Ghent University, St-Pietersnieuwstraat 41, B-9000 Gent, Belgium;

imec-IPI-Ghent University, St-Pietersnieuwstraat 41, B-9000 Gent, Belgium;

imec-IPI-Ghent University, St-Pietersnieuwstraat 41, B-9000 Gent, Belgium;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
depth image; neural networks; deep learning; lidar; u-net; reconstruction;

机译：深度图像神经网络;深度学习激光雷达网络重建;

相似文献

外文文献
中文文献
专利

1. Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks [J] . Yuanzhouhan Cao, Zifeng Wu, Chunhua Shen IEEE Transactions on Circuits and Systems for Video Technology . 2018,第11期

机译：使用深度完全卷积残差网络将单眼图像的深度估计为分类
2. Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields [J] . Fayao Liu, Chunhua Shen, Guosheng Lin, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2016,第10期

机译：使用深度卷积神经场从单眼图像中学习深度
3. Single image super-resolution based on adaptive convolutional sparse coding and convolutional neural networks [J] . Zhao Jianwei, Chen Chen, Zhou Zhenghua, Journal of visual communication & image representation . 2019,第JANa期

机译：基于自适应卷积稀疏编码和卷积神经网络的单图像超分辨率
4. High resolution depth reconstruction from monocular images and sparse point clouds using deep convolutional neural network [C] . Martin Dimitrievski, Bart Goossens, Peter Veelaert, Conference on Unconventional and Indirect Imaging, Image Reconstruction, and Wavefront Sensing . 2017

机译：使用深卷积神经网络从单眼图像和稀疏点云的高分辨率深度重建
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Urban Area Detection in Very High Resolution Remote Sensing Images Using Deep Convolutional Neural Networks [O] . Tian Tian, Chang Li, Jinkang Xu, 2018

机译：基于深度卷积神经网络的超高分辨率遥感影像市区检测
7. Deep Depth Super-Resolution : Learning Depth Super-Resolution using Deep Convolutional Neural Network [O] . Song, Xibin, Dai, Yuchao, Qin, Xueying 2016

机译：深度超深度分辨率：使用深度学习深度超分辨率卷积神经网络

High resolution depth reconstruction from monocular images and sparse point clouds using deep convolutional neural network

摘要

著录项

相似文献

相关主题

期刊订阅