首页> 外文期刊>Computer vision and image understanding >Deep 3D morphable model refinement via progressive growing of conditional Generative Adversarial Networks
【24h】

Deep 3D morphable model refinement via progressive growing of conditional Generative Adversarial Networks

机译:通过逐步生成条件生成对抗网络,对3D变形模型进行深度优化

获取原文
获取原文并翻译 | 示例
           

摘要

3D face reconstruction from a single 2D image is a fundamental Computer Vision problem of extraordinary difficulty. Statistical modeling techniques, such as the 3D Morphable Model (3DMM), have been widely exploited because of their capability of reconstructing a plausible model grounding on the prior knowledge of the facial shape. However, most of these techniques derive an approximated and smooth reconstruction of the face, without accounting for fine-grained details. In this work, we propose an approach based on a Conditional Generative Adversarial Network (CGAN) for refining the coarse reconstruction provided by a 3DMM. The latter is represented as a three channels image, where the pixel intensities represent the depth, curvature and elevation values of the 3D vertices. The architecture is an encoder-decoder, which is trained progressively, starting from the lower-resolution layers; this technique allows a more stable training, which leads to the generation of high quality outputs even when high-resolution images are fed during the training. Experimental results show that our method is able to produce reconstructions with fine-grained realistic details and lower reconstruction errors with respect to the 3DMM. A cross-dataset evaluation also shows that the network retains good generalization capabilities. Finally, comparison with state-of-the-art solutions evidence competitive performance, with comparable or lower error in most of the cases, and a clear improvement in the quality of the generated models.
机译:从单个2D图像重建3D人脸是极其困难的基本计算机视觉问题。统计建模技术(例如3D Morphable Model(3DMM))已被广泛利用,因为它们具有基于面部形状的先验知识重建合理模型的能力。但是,大多数这些技术都可以在不考虑细化细节的情况下获得近似平滑的人脸重建效果。在这项工作中,我们提出了一种基于条件生成对抗网络(CGAN)的方法,用于完善3DMM提供的粗略重构。后者表示为三通道图像,其中像素强度表示3D顶点的深度,曲率和高程值。该体系结构是编码器/解码器,从较低分辨率的层开始逐步进行训练。此技术可实现更稳定的训练,即使在训练过程中输入了高分辨率图像,也可以生成高质量的输出。实验结果表明,相对于3DMM,我们的方法能够产生具有细粒度逼真的细节的重构,并降低了重构误差。跨数据集评估还表明,该网络保留了良好的泛化能力。最后,与最新解决方案的比较证明了竞争性能,在大多数情况下具有可比较的或更低的误差,并且明显改善了所生成模型的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号