...
首页> 外文期刊>Computers & Graphics >Adaptive depth estimation for pyramid multi-view stereo
【24h】

Adaptive depth estimation for pyramid multi-view stereo

机译:金字塔多视图立体声的自适应深度估计

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a Multi-View Stereo (MVS) network which can perform efficient high-resolution depth estimation with low memory consumption. Classical learning-based MVS approaches typically construct 3D cost volumes to regress depth information, making the output resolution rather limited as the memory consumption grows cubically with the input resolution. Although recent approaches have made significant progress in scalability by introducing the coarse-to-fine fashion or sequential cost map regularization, the memory consumption still grows quadratically with input resolution and is not friendly for commodity GPU. Observing that the surfaces of most objects in real world are locally smooth, we assume that most of the depth hypotheses upsampled from a well-estimated depth map are accurate. Based on the assumption, we propose a pyramid MVS network based on the adaptive depth estimation, which gradually refines and upsamples the depth map to the desired resolution. Instead of estimating depth hypotheses for all pixels in the depth map, our method only performs prediction at adaptively selected locations, alleviating excessive computation on well-estimated positions. To estimate depth hypotheses for sparse selected locations, we propose the lightweight pixelwise depth estimation network, which can estimate depth value for each selected location independently. Experiments demonstrate that our method can generate results comparable with the state-of-the-art learning-based methods while reconstructing more geometric details and consuming less GPU memory.(c) 2021 Elsevier Ltd. All rights reserved.
机译:在本文中,我们提出了一种多视图立体声(MVS)网络,其可以使用低存储器消耗执行有效的高分辨率深度估计。基于古典学习的MVS方法通常构建3D成本卷来回归深度信息,使输出分辨率相当受到限制,因为存储器消耗随输入分辨率的差异而增加。尽管最近的方法通过引入粗细的时尚或顺序成本图正常化来实现可扩展性取得了重大进展,但是内存消耗仍然具有二次输入分辨率,并且对于商品GPU并不友好。观察到现实世界中大多数物体的表面是局部光滑的,我们假设从估计估计的深度图上采样的大多数深度假设都是准确的。基于该假设,我们提出了一种基于自适应深度估计的金字塔MVS网络,该网络逐渐将深度图逐渐改进并将深度映射提升到所需的分辨率。代替估计深度图中所有像素的深度假设,我们的方法仅在自适应选择的位置执行预测,减轻了对估计估计位置的过度计算。为了估计稀疏所选位置的深度假设,我们提出了轻量级像素深度估计网络,其可以独立地估计每个所选位置的深度值。实验表明,我们的方法可以生成与最先进的基于学习的方法相当的结果,同时重建更多的几何细节并消耗较少的GPU内存。(c)2021 elestvier有限公司保留所有权利。

著录项

  • 来源
    《Computers & Graphics》 |2021年第6期|268-278|共11页
  • 作者单位

    Wuhan Univ Sch Comp Sci Wuhan Peoples R China;

    Anhui Univ Sch Comp Sci & Technol Hefei Peoples R China;

    JD Com Silicon Valley Res Ctr Multimedia Software Beijing Peoples R China;

    Wuhan Univ Sch Comp Sci Wuhan Peoples R China;

    Wuhan Univ Sch Comp Sci Wuhan Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    3D Reconstruction; Multi-View Stereo; Deep Learning;

    机译:3D重建;多视图立体声;深入学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号