首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

【24h】

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

机译：高分辨率多视图立体声和立体声匹配的级联成本量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The deep multi-view stereo (MVS) and stereo matching approaches generally construct 3D cost volumes to regularize and regress the output depth or disparity. These methods are limited when high-resolution outputs are needed since the memory and time costs grow cubically as the volume resolution increases. In this paper, we propose a both memory and time efficient cost volume formulation that is complementary to existing multi-view stereo and stereo matching approaches based on 3D cost volumes. First, the proposed cost volume is built upon a standard feature pyramid encoding geometry and context at gradually finer scales. Then, we can narrow the depth (or disparity) range of each stage by the depth (or disparity) map from the previous stage. With gradually higher cost volume resolution and adaptive adjustment of depth (or disparity) intervals, the output is recovered in a coarser to fine manner. We apply the cascade cost volume to the representative MVS-Net, and obtain a 35.6% improvement on DTU benchmark (1st place), with 50.6% and 59.3% reduction in GPU memory and run-time. It is also the state-of-the-art learning-based method on Tanks and Temples benchmark. The statistics of accuracy, run-time and GPU memory on other representative stereo CNNs also validate the effectiveness of our proposed method. Our source code is available at https://github.com/alibaba/cascade-stereo.

机译：深度多视图立体（MVS）和立体匹配方法通常构造3D成本量，以规范化和回归输出深度或视差。当需要高分辨率输出时，这些方法会受到限制，因为随着体积分辨率的提高，内存和时间成本将呈立方增长。在本文中，我们提出了一种既节省内存又节省时间的成本公式，以补充现有的多视图立体和基于3D成本的立体匹配方法。首先，建议的成本量是建立在标准特征金字塔编码几何结构和上下文的基础上的，而且逐渐缩小。然后，我们可以通过上一个阶段的深度（或视差）图来缩小每个阶段的深度（或视差）范围。随着成本体积分辨率的逐步提高和深度（或视差）间隔的自适应调整，输出将以较粗糙到精细的方式恢复。我们将级联成本量应用于代表性的MVS-Net，并在DTU基准上获得了35.6％的提升（第一名），GPU内存和运行时间分别减少了50.6％和59.3％。它也是基于Tanks和Temples基准的最先进的基于学习的方法。其他代表性立体声CNN的准确性，运行时间和GPU内存的统计数据也验证了我们提出的方法的有效性。我们的源代码位于https://github.com/alibaba/cascade-stereo。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition 》|2020年|2492-2501|共10页
会议地点
作者
Xiaodong Gu; Zhiwen Fan; Siyu Zhu; Zuozhuo Dai; Feitong Tan; Ping Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Three-dimensional displays; Graphics processing units; Two dimensional displays; Feature extraction; Standards; Cameras; Spatial resolution;

机译：三维显示器;图形处理单元;二维显示器;特征提取;标准;相机;空间分辨率;

相似文献

外文文献
中文文献
专利

1. Dehazing cost volume for deep multi-view stereo in scattering media with airlight and scattering coefficient estimation [J] . Yuki Fujimura, Motoharu Sonogashira, Masaaki Iiyama Computer vision and image understanding . 2021 ,第Octa期

机译：在散射介质中的深度多视图立体声脱落成本量，散射系数估计
2. Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction [J] . Yu Anzhu, Guo Wenyue, Liu Bing, ISPRS Journal of Photogrammetry and Remote Sensing . 2021 ,第May期

机译：注意力意识到成本量基于金字塔的三维重建多视图立体网络
3. Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy [J] . Liang Zhengfa, Guo Yulan, Feng Yiliu, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021 ,第1期

机译：立体匹配使用多级成本卷和多尺度功能恒定
4. SCV-Stereo: Learning Stereo Matching From a Sparse Cost Volume [C] . Hengli Wang, Rui Fan, Ming Liu IEEE International Conference on Image Processing . 2021

机译：SCV-Stereo：从稀疏成本卷学习立体声匹配
5. Stereo matching: Evaluation of three algorithms and two cost functions. [D] . Jordan, Victor Jacob. 2012

机译：立体匹配：评估三种算法和两种成本函数。
6. Cross-trees Edge and Superpixel Priors-based Cost aggregation for Stereo matching [O] . Feiyang Cheng, Hong Zhang, Mingui Sun, -1

机译：基于跨树边缘和超像素优先级的成本汇总用于立体匹配
7. CSNet: Cascade stereo matching network using multi‐information cost volume [O] . XiaoTao Shao, Wen Zhang, MingKun Guo, 2021

机译：CSNet：使用多信息成本卷的级联立体声匹配网络

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching

摘要

著录项

相似文献

相关主题

期刊订阅