DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

L. Madhuanand; F. Nex; M. Y. Yang

首页> 外文期刊>ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences >DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

【24h】

DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

机译：从UAV图像中的单眼深度估计深度学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Depth is an essential component for various scene understanding tasks and for reconstructing the 3D geometry of the scene. Estimating depth from stereo images requires multiple views of the same scene to be captured which is often not possible when exploring new environments with a UAV. To overcome this monocular depth estimation has been a topic of interest with the recent advancements in computer vision and deep learning techniques. This research has been widely focused on indoor scenes or outdoor scenes captured at ground level. Single image depth estimation from aerial images has been limited due to additional complexities arising from increased camera distance, wider area coverage with lots of occlusions. A new aerial image dataset is prepared specifically for this purpose combining Unmanned Aerial Vehicles (UAV) images covering different regions, features and point of views. The single image depth estimation is based on image reconstruction techniques which uses stereo images for learning to estimate depth from single images. Among the various available models for ground-level single image depth estimation, two models, 1) a Convolutional Neural Network (CNN) and 2) a Generative Adversarial model (GAN) are used to learn depth from aerial images from UAVs. These models generate pixel-wise disparity images which could be converted into depth information. The generated disparity maps from these models are evaluated for its internal quality using various error metrics. The results show higher disparity ranges with smoother images generated by CNN model and sharper images with lesser disparity range generated by GAN model. The produced disparity images are converted to depth information and compared with point clouds obtained using Pix4D. It is found that the CNN model performs better than GAN and produces depth similar to that of Pix4D. This comparison helps in streamlining the efforts to produce depth from a single aerial image.

机译：深度是各种场景理解任务的重要组成部分，以及重建场景的3D几何形状。估计来自立体图像的深度需要要捕获的相同场景的多个视图，这通常是不可能的，当探索具有UAV的新环境时。为了克服这种单眼深度估计是计算机视觉和深度学习技术的最新进步感兴趣的主题。该研究已广泛关注地面捕获的室内场景或室外场景。由于摄像机距离增加，因此由于相机距离增加，并且具有许多闭塞而导致的额外复杂性，从航拍图像的单个图像深度估计受到限制。为此目的，特别是为此目的准备了一种新的空中图像数据集，这些目的结合了覆盖不同地区的无人机（UAV）图像，特征和视点。单个图像深度估计基于图像重建技术，该技术使用立体图像来学习从单个图像估计深度。在地面单个图像深度估计的各种可用模型中，两个模型，1）卷积神经网络（CNN）和2）用于从UAVS的空中图像学习深度。这些模型生成可以转换为深度信息的像素 - WISE视差图像。使用各种错误指标评估来自这些模型的产生的差异图。结果显示了由CNN模型产生的更高的视差范围，并通过GaN模型产生的较低的差距范围更高的图像。产生的视差图像被转换为深度信息，并与使用PIX4D获得的点云进行比较。发现CNN模型比GaN更好地执行，并产生与PIX4D类似的深度。这种比较有助于简化从单个航拍图像产生深度的努力。

著录项

来源
《ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences》 |2020年第5期|共8页
作者
L. Madhuanand; F. Nex; M. Y. Yang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Measuring In-Building Spatial-Temporal Human Distribution through Monocular Image Data Considering Deep Learning-Based Image Depth Estimation [J] . Qiu Wen-Xin, Han Jen-Yu, Chen Albert Y. Journal of Computing in Civil Engineering . 2021,第5期

机译：考虑基于深度学习的图像深度估计，通过单眼图像数据测量建立空间的人类分布
2. Deep Joint Depth Estimation and Color Correction From Monocular Underwater Images Based on Unsupervised Adaptation Networks [J] . Xinchen Ye, Zheng Li, Baoli Sun, Circuits and Systems for Video Technology, IEEE Transactions on . 2020,第11期

机译：基于无监督适应网络的单眼水下图像深度关节深度估计和颜色校正
3. Depth estimation from single monocular images using deep hybrid network [J] . Grigorev Aleksei, Jiang Feng, Rho Seungmin, Multimedia Tools and Applications . 2017,第18期

机译：使用深度混合网络从单眼图像进行深度估计
4. Depth distillation: unsupervised metric depth estimation for UAVs by finding consensus between kinematics, optical flow and deep learning [C] . Mihai Pirvu, Victor Robu, Vlad Licaret, IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops . 2021

机译：深度蒸馏：通过在运动学，光流量和深度学习之间找到共识
5. Internal and External Feature Engineering Applied to Deep Learning with Convolutional Neural Networks for Monocular Relative Pose Estimation in Visual Odometry and Self-Localization [D] . Parkins, Franz Payton. 2020

机译：内部和外部特征工程应用于卷积神经网络的深度学习，用于视觉测量和自定位中的单眼相对姿态估计
6. Integrating Sensor Models in Deep Learning Boosts Performance: Application to Monocular Depth Estimation in Warehouse Automation [O] . Ryota Yoneyama, Angel J. Duran, Angel P. del Pobil 2021

机译：集成在深度学习中的传感器模型提升了性能：应用于仓库自动化中的单眼深度估计
7. Relative depth estimation from single monocular images with deep convolutional network [O] . Alex Yang -1

机译：具有深度卷积网络的单眼图像的相对深度估计

DEEP LEARNING FOR MONOCULAR DEPTH ESTIMATION FROM UAV IMAGES

摘要

著录项

相似文献

相关主题

期刊订阅