Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference

Li Bo; Dai Yuchao; He Mingyi

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference

【24h】

Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference

机译：具有扩张CNN的分层融合的单眼深度估计和软加权 - 和推论

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monocular depth estimation is very challenging in complex compositions depicting multiple objects of diverse scales. Albeit the recent great progress thanks to the deep convolutional neural networks, the state-of-the-art monocular depth estimation methods still fall short to handle such real-world challenging scenarios. In this paper, we propose a deep end-to-end learning framework to tackle these challenges, which learns the direct mapping from a color image to the corresponding depth map. First, we represent monocular depth estimation as a multi-category dense labeling task by contrast to the regression-based formulation. In this way, we could build upon the recent progress in dense labeling such as semantic segmentation. Second, we fuse different side-outputs from our front-end dilated convolutional neural network in a hierarchical way to exploit the multi-scale depth cues for monocular depth estimation, which is critical in achieving scale-aware depth estimation. Third, we propose to utilize soft-weighted sum inference instead of the hard-max inference, transforming the discretized depth scores to continuous depth values. Thus, we reduce the influence of quantization error and improve the robustness of our method. Extensive experiments have been conducted on the Make3D, NYU v2, and KITTI datasets and superior performance have been achieved on NYU v2 and KITTI datasets compared with current state-of-the-art methods, which shows the superiority of our method. Furthermore, experiments on the NYU v2 dataset reveal that our classification based model is able to learn the probability distribution of depth. (C) 2018 Elsevier Ltd. All rights reserved.

机译：单目深度估计在描绘多个不同尺度物体的复杂构图中非常具有挑战性。尽管最近由于深度卷积神经网络取得了巨大的进步，但最先进的单目深度估计方法仍然无法处理此类现实世界的挑战场景。在本文中，我们提出了一个深度端到端学习框架来应对这些挑战，该框架学习从彩色图像到相应深度图的直接映射。首先，与基于回归的公式相比，我们将单目深度估计表示为多类别密集标记任务。通过这种方式，我们可以在语义分割等密集标记的最新进展的基础上继续发展。其次，我们将前端扩展卷积神经网络的不同侧面输出进行分层融合，以利用多尺度深度线索进行单目深度估计，这对于实现尺度感知深度估计至关重要。第三，我们建议使用软加权和推理代替硬最大推理，将离散化的深度分数转换为连续的深度值。因此，我们减少了量化误差的影响，提高了方法的鲁棒性。在Make3D、NYU v2和KITTI数据集上进行了大量实验，与当前最先进的方法相比，在NYU v2和KITTI数据集上取得了优异的性能，这表明了我们方法的优越性。此外，在纽约大学v2数据集上的实验表明，基于分类的模型能够学习深度的概率分布。（C） 2018爱思唯尔有限公司版权所有。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2018年第2018期|共12页
作者
Li Bo; Dai Yuchao; He Mingyi;
展开▼
作者单位

Northwestern Polytech Univ Sch Elect &

Informat Shaanxi Key Lab Informat Acquisit &

Proc Xian 710129 Shaanxi Peoples R China;

Northwestern Polytech Univ Sch Elect &

Informat Shaanxi Key Lab Informat Acquisit &

Proc Xian 710129 Shaanxi Peoples R China;

Northwestern Polytech Univ Sch Elect &

Informat Shaanxi Key Lab Informat Acquisit &

Proc Xian 710129 Shaanxi Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Monocular depth estimation; Deep convolutional neural network; Soft-weighted-sum-inference; Dilated convolution;

机译：单眼深度估计;深卷积神经网络;软加权和推理;扩张卷积;

相似文献

外文文献
中文文献
专利

1. MonocuIar depth ordering with occIusion edges extraction and IocaI depth inference [J] . SONG GuiIing, YU Aiwei, KANG Xuejing, 系统工程与电子技术（英文版） . 2019,第006期
2. Effect of Fuelling Depth on the Fusion Performance and Particle Confinement of a Fusion Reactor [J] . WANG Shijia, WANG Shaojie 等离子体科学和技术（英文版） . 2016,第012期
3. Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference [J] . Li Bo, Dai Yuchao, He Mingyi Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：具有扩张CNN的分层融合的单眼深度估计和软加权 - 和推论
4. Simultaneous Attack on CNN-Based Monocular Depth Estimation and Optical Flow Estimation [J] . Koichiro YAMANAKA, Keita TAKAHASHI, Toshiaki FUJII, IEICE transactions on information and systems . 2021,第5期

机译：同时攻击基于CNN的单眼深度估计和光学流量估计
5. Monocular depth ordering with occlusion edges extraction and local depth inference [J] . Systems Engineering and Electronics, Journal of . 2019,第6期

机译：具有遮挡边缘提取和局部深度推断的单眼深度排序
6. Spindle-Net: CNNs for Monocular Depth Inference with Dilation Kernel Method [C] . Lei He, Miao Yu, Guanghui Wang International Conference on Pattern Recognition . 2018

机译：主轴网：使用膨胀核方法进行单眼深度推断的CNN
7. Monocular Depth Estimation Using Adversarial Training [D] . Mitra, Pallavi. 2020

机译：使用对抗性培训的单眼深度估计
8. Fast CNN Stereo Depth Estimation through Embedded GPU Devices [O] . Cristhian A. Aguilera, Cristhian Aguilera, Cristóbal A. Navarro, 2020

机译：通过嵌入式GPU设备进行快速CNN立体声深度估计
9. Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference [O] . Li, Bo, Dai, Yuchao, He, Mingyi 2017

机译：扩张的CNNs与层状融合的单目深度估计软加权和推论
10. Comprehensive Approach to Fusion for Microsensor Networks: Distributed and Hierarchical Inference, Communication, and Adaption [R] . Fisher, J. W. , Cetin, M. , Jaakkola, T. , 2000

机译：微传感器网络融合的综合方法：分布式和分层推理，通信和适应

Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference

摘要

著录项

相似文献

相关主题

期刊订阅