High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder

【24h】

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder

机译：基于无监督模型的人脸自编码器的高保真单眼人脸重构

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this work, we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is the differentiable parametric decoder that encapsulates image formation analytically based on a generative model. Our decoder takes as input a code vector with exactly defined semantic meaning that encodes detailed face pose, shape, expression, skin reflectance, and scene illumination. Due to this new way of combining CNN-based with model-based face reconstruction, the CNN-based encoder learns to extract semantically meaningful parameters from a single monocular input image. For the first time, a CNN encoder and an expert-designed generative model can be trained end-to-end in an unsupervised manner, which renders training on very large (unlabeled) real world datasets feasible. The obtained reconstructions compare favorably to current state-of-the-art approaches in terms of quality and richness of representation. This work is an extended version of [1] , where we additionally present a stochastic vertex sampling technique for faster training of our networks, and moreover, we propose and evaluate analysis-by-synthesis and shape-from-shading refinement approaches to achieve a high-fidelity reconstruction.

机译：在这项工作中，我们提出了一种新颖的基于模型的深度卷积自动编码器，该编码器解决了从单个野生彩色图像中重建3D人脸的极具挑战性的问题。为此，我们将卷积编码器网络与专家设计的生成模型（用作解码器）相结合。核心创新是可微分参数解码器，该微分参数解码器基于生成模型分析性地封装了图像形成。我们的解码器将具有精确定义的语义含义的代码向量作为输入，对详细的面部姿势，形状，表情，皮肤反射率和场景照明进行编码。由于这种将基于CNN与基于模型的人脸重建相结合的新方式，基于CNN的编码器学会了从单个单眼输入图像中提取语义上有意义的参数。第一次，CNN编码器和专家设计的生成模型可以无监督的方式进行端到端的训练，这使得在非常大的（未标记）现实世界数据集上进行训练变得可行。就表示的质量和丰富性而言，所获得的重构与当前的最新技术相比具有优势。这项工作是[1]的扩展版本，在此我们另外提出了一种随机顶点采样技术，可以更快地训练我们的网络，此外，我们提出并评估了综合分析和阴影形状优化方法来实现高保真重建。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2020年第2期|357-370|共14页
作者

展开▼
作者单位

Max Planck Inst Informat D-66123 Saarbrcken Germany;

Stanford Univ Stanford CA 94305 USA;

Technicolor F-92130 Issy Les Moulineaux France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Face; Image reconstruction; Three-dimensional displays; Training; Decoding; Shape; Lighting;

机译：面对;影像重建;三维显示器;训练;解码;形状;灯光;

相似文献

外文文献
中文文献
专利

1. Model-based human gait tracking, 3D reconstruction and recognition in uncalibrated monocular video [J] . E Adeli-Mosabbeb, M Fathy, F Zargari The imaging science journal . 2012,第1期

机译：未校准的单眼视频中基于模型的步态跟踪，3D重建和识别
2. High-resolution CT with new model-based iterative reconstruction with resolution preference algorithm in evaluations of lung nodules: Comparison with conventional model-based iterative reconstruction and adaptive statistical iterative reconstruction [J] . Yasaka Koichiro, Katsura Masaki, Hanaoka Shouhei, European Journal of Radiology . 2016,第3期

机译：高分辨率CT与基于模型的基于迭代优先重构的新模型的迭代重建在肺结节评估中的应用：与基于传统模型的迭代重建和自适应统计迭代重建的比较
3. Pollen-based biome reconstructions for the past 450000 yr from the Funza-2 core, Colombia: Comparisons with model-based vegetation reconstructions [J] . Robert Marchant, Arnoud Boom, Henry Hooghiemstra Palaeogeography, Palaeoclimatology, Palaeoecology: An International Journal for the Geo-Sciences . 2002,第1a2期

机译：来自哥伦比亚Funza-2核心的过去450000年基于花粉的生物群落重建：与基于模型的植被重建的比较
4. MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction [C] . Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, IEEE International Conference on Computer Vision Workshops . 2017

机译：MoFA：用于无监督单眼重构的基于模型的深度卷积脸部自动编码器
5. Model-Based Hand Posture Estimation Using Monocular Camera. [D] . Zhang, Liwen. 2012

机译：使用单眼相机的基于模型的手势估计。
6. Unsupervised Monocular Depth Estimation Method Based on Uncertainty Analysis and Retinex Algorithm [O] . Chuanxue Song, Chunyang Qi, Shixin Song, 2020

机译：基于不确定性分析和Retinex算法的无监督单眼深度估计方法
7. {MoFA}: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction [O] . Tewari, A., Zollhöfer, M., Kim, H., 2017

机译：{moFa}：基于模型的深度卷积面自动编码器，用于无监督单眼重建
8. Three Dimensional Structure of Core-Shell Nanoparticle Assemblies: Electron Tomography and Model-Based Iterative Reconstructions (Preprint). [R] . Hsiao, M., Vaia, R., Drummy, L. F., 2017

机译：核壳纳米粒子组件的三维结构：电子层析成像和基于模型的迭代重建（预印本）。

High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅