首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder
【24h】

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder

机译:基于无监督模型的人脸自编码器的高保真单眼人脸重构

获取原文
获取原文并翻译 | 示例
           

摘要

In this work, we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is the differentiable parametric decoder that encapsulates image formation analytically based on a generative model. Our decoder takes as input a code vector with exactly defined semantic meaning that encodes detailed face pose, shape, expression, skin reflectance, and scene illumination. Due to this new way of combining CNN-based with model-based face reconstruction, the CNN-based encoder learns to extract semantically meaningful parameters from a single monocular input image. For the first time, a CNN encoder and an expert-designed generative model can be trained end-to-end in an unsupervised manner, which renders training on very large (unlabeled) real world datasets feasible. The obtained reconstructions compare favorably to current state-of-the-art approaches in terms of quality and richness of representation. This work is an extended version of [1] , where we additionally present a stochastic vertex sampling technique for faster training of our networks, and moreover, we propose and evaluate analysis-by-synthesis and shape-from-shading refinement approaches to achieve a high-fidelity reconstruction.
机译:在这项工作中,我们提出了一种新颖的基于模型的深度卷积自动编码器,该编码器解决了从单个野生彩色图像中重建3D人脸的极具挑战性的问题。为此,我们将卷积编码器网络与专家设计的生成模型(用作解码器)相结合。核心创新是可微分参数解码器,该微分参数解码器基于生成模型分析性地封装了图像形成。我们的解码器将具有精确定义的语义含义的代码向量作为输入,对详细的面部姿势,形状,表情,皮肤反射率和场景照明进行编码。由于这种将基于CNN与基于模型的人脸重建相结合的新方式,基于CNN的编码器学会了从单个单眼输入图像中提取语义上有意义的参数。第一次,CNN编码器和专家设计的生成模型可以无监督的方式进行端到端的训练,这使得在非常大的(未标记)现实世界数据集上进行训练变得可行。就表示的质量和丰富性而言,所获得的重构与当前的最新技术相比具有优势。这项工作是[1]的扩展版本,在此我们另外提出了一种随机顶点采样技术,可以更快地训练我们的网络,此外,我们提出并评估了综合分析和阴影形状优化方法来实现高保真重建。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号