首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Structure-Guided Ranking Loss for Single Image Depth Prediction
【24h】

Structure-Guided Ranking Loss for Single Image Depth Prediction

机译:结构指导的单图像深度预测排名损失

获取原文

摘要

Single image depth prediction is a challenging task due to its ill-posed nature and challenges with capturing ground truth for supervision. Large-scale disparity data generated from stereo photos and 3D videos is a promising source of supervision, however, such disparity data can only approximate the inverse ground truth depth up to an affine transformation. To more effectively learn from such pseudo-depth data, we propose to use a simple pair-wise ranking loss with a novel sampling strategy. Instead of randomly sampling point pairs, we guide the sampling to better characterize structure of important regions based on the low-level edge maps and high-level object instance masks. We show that the pair-wise ranking loss, combined with our structure-guided sampling strategies, can significantly improve the quality of depth map prediction. In addition, we introduce a new relative depth dataset of about 21K diverse high-resolution web stereo photos to enhance the generalization ability of our model. In experiments, we conduct cross-dataset evaluation on six benchmark datasets and show that our method consistently improves over the baselines, leading to superior quantitative and qualitative results.
机译:单一图像深度预测由于其不适当的性质以及捕获地面实物进行监督所面临的挑战,是一项具有挑战性的任务。从立体照片和3D视频生成的大规模视差数据是有希望的监管来源,但是,此类视差数据只能近似逆地面真实深度,直到进行仿射变换。为了更有效地从此类伪深度数据中学习,我们建议使用简单的成对排名损失和新颖的采样策略。代替随机采样点对,我们指导采样以基于低级边缘图和高级对象实例蒙版更好地表征重要区域的结构。我们表明,成对排名损失与我们的结构指导采样策略相结合,可以显着提高深度图预测的质量。此外,我们引入了约21K张各种高分辨率网络立体照片的新的相对深度数据集,以增强模型的泛化能力。在实验中,我们对六个基准数据集进行了跨数据集评估,结果表明,我们的方法在基线之上持续改进,从而带来了出色的定量和定性结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号