Structure-Guided Ranking Loss for Single Image Depth Prediction

机译：结构指导的单图像深度预测排名损失

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Single image depth prediction is a challenging task due to its ill-posed nature and challenges with capturing ground truth for supervision. Large-scale disparity data generated from stereo photos and 3D videos is a promising source of supervision, however, such disparity data can only approximate the inverse ground truth depth up to an affine transformation. To more effectively learn from such pseudo-depth data, we propose to use a simple pair-wise ranking loss with a novel sampling strategy. Instead of randomly sampling point pairs, we guide the sampling to better characterize structure of important regions based on the low-level edge maps and high-level object instance masks. We show that the pair-wise ranking loss, combined with our structure-guided sampling strategies, can significantly improve the quality of depth map prediction. In addition, we introduce a new relative depth dataset of about 21K diverse high-resolution web stereo photos to enhance the generalization ability of our model. In experiments, we conduct cross-dataset evaluation on six benchmark datasets and show that our method consistently improves over the baselines, leading to superior quantitative and qualitative results.

机译：单一图像深度预测由于其不适当的性质以及捕获地面实物进行监督所面临的挑战，是一项具有挑战性的任务。从立体照片和3D视频生成的大规模视差数据是有希望的监管来源，但是，此类视差数据只能近似逆地面真实深度，直到进行仿射变换。为了更有效地从此类伪深度数据中学习，我们建议使用简单的成对排名损失和新颖的采样策略。代替随机采样点对，我们指导采样以基于低级边缘图和高级对象实例蒙版更好地表征重要区域的结构。我们表明，成对排名损失与我们的结构指导采样策略相结合，可以显着提高深度图预测的质量。此外，我们引入了约21K张各种高分辨率网络立体照片的新的相对深度数据集，以增强模型的泛化能力。在实验中，我们对六个基准数据集进行了跨数据集评估，结果表明，我们的方法在基线之上持续改进，从而带来了出色的定量和定性结果。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|608-617|共10页
会议地点
作者
Ke Xian; Jianming Zhang; Oliver Wang; Long Mai; Zhe Lin; Zhiguo Cao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image edge detection; Training; Three-dimensional displays; Sensors; Task analysis; Videos; Measurement;

机译：图像边缘检测;训练;三维显示;传感器;任务分析;视频;测量;

相似文献

外文文献
中文文献
专利

1. Depth map prediction from a single image with generative adversarial nets [J] . Shaoyong Zhang, Na Li, Chenchen Qiu, Multimedia Tools and Applications . 2020,第21a22期

机译：来自单个图像的深度地图预测，具有生成的对抗网
2. A Unified Framework for Depth Prediction from a Single Image and Binocular Stereo Matching [J] . International journal of applied mechanics . 2020,第3期

机译：从单个图像和双目立体匹配的深度预测统一框架
3. Peeking behind objects: Layered depth prediction from a single image [J] . Dhamo Helisa, Tateno Keisuke, Laina Iro, Pattern recognition letters . 2019,第JULa期

机译：在对象后面偷看：从单个图像进行分层深度预测
4. Shading Structure-Guided Depth Image Restoration [C] . Xiuxiu Li, Haiyan Jin, Yanjuan Liu, International conference on brain-inspired cognitive systems . 2018

机译：阴影结构引导的深度图像还原
5. Convolutional neural network based age estimation from facial image and depth prediction from single image. [D] . Qiu, Jiayan. 2016

机译：基于卷积神经网络的基于面部图像的年龄估计和基于单个图像的深度预测。
6. Registered Relief Depth (RRD) borobudur dataset for single-frame depth prediction on one-side artifacts [O] . Aufaclav Zatu Kusuma Frisky, Agus Harjoko, Lukman Awaludin, 2021

机译：用于单帧文物的单帧深度预测的注册浮雕深度（RRD）Borobudur数据集
7. Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image [O] . Ma, Fangchang, Karaman, Sertac 2017

机译：稀疏到密集：稀疏深度样本和单个样本的深度预测图片

Structure-Guided Ranking Loss for Single Image Depth Prediction

摘要

著录项

相似文献

相关主题

期刊订阅