首页> 外文会议>12th IEEE International Conference on Automatic Face and Gesture Recognition >Learning Local Responses of Facial Landmarks with Conditional Variational Auto-Encoder for Face Alignment
【24h】

Learning Local Responses of Facial Landmarks with Conditional Variational Auto-Encoder for Face Alignment

机译:使用条件变分自动编码器对人脸对齐学习面部地标的局部响应

获取原文
获取原文并翻译 | 示例

摘要

This work proposes a novel convolutional neural network architecture which can locate landmarks accurately by learning local responses of facial landmarks. The network consists of a Conditional Variational Auto-Encoder(CVAE) and a Deep Convolutional Neural Network(DCNN). The CVAE is used to learn the response maps of facial landmarks from face images and the DCNN is used to learn accurate landmark locations from the response maps and facial textures. The CVAE consists of a face encoder, which extracts high-level information from raw pixels, and a decoder which outputs local response maps from high-level coding. We derive the CVAE used for catching local responses as an optimization problem, which can be solved through back-propagation. Extensive experiments show that the proposed CVAE can learn better local response maps than Fully Convolutional Network(FCN). Our method outperforms state-of-the-art methods on AFLW(5 points) and the challenging subset of 300-W(68 points), which means our method shows advantages in the condition of complex poses and expressions.
机译:这项工作提出了一种新颖的卷积神经网络架构,该架构可以通过学习面部地标的局部响应来准确地定位地标。该网络由条件变分自动编码器(CVAE)和深度卷积神经网络(DCNN)组成。 CVAE用于从面部图像中学习面部地标的响应图,而DCNN用于从响应图和面部纹理中了解准确的地标位置。 CVAE包括一个从原始像素中提取高级信息的面部编码器,以及一个从高级编码中输出本地响应图的解码器。我们将用于捕获本地响应的CVAE作为优化问题导出,可以通过反向传播解决。大量实验表明,所提出的CVAE比全卷积网络(FCN)可以更好地学习局部响应图。我们的方法在AFLW(5分)和具有挑战性的300-W(68分)的子集上优于最新方法,这意味着我们的方法在复杂的姿势和表情条件下显示出优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号