首页> 外文会议>International Conference on Automatic Face and Gesture Recognition >Generalizing to Unseen Head Poses in Facial Expression Recognition and Action Unit Intensity Estimation
【24h】

Generalizing to Unseen Head Poses in Facial Expression Recognition and Action Unit Intensity Estimation

机译:在面部表情识别和动作单位强度估计中推广到看不见的头部姿势

获取原文
获取外文期刊封面目录资料

摘要

Facial expression analysis is challenged by the numerous degrees of freedom regarding head pose, identity, illumination, occlusions, and the expressions itself. It currently seems hardly possible to densely cover this enormous space with data for training a universal well-performing expression recognition system. In this paper we address the sub-challenge of generalizing to head poses that were not seen in the training data, aiming at getting along with sparse coverage of the pose subspace. For this purpose we (1) propose a novel face normalization method called FaNC that massively reduces pose-induced image variance; (2) we compare the impact of the proposed and other normalization methods on (a) action unit intensity estimation with the FERA 2017 challenge data (achieving new state of the art) and (b) facial expression recognition with the Multi-PIE dataset; and (3) we discuss the head pose distribution needed to train a pose-invariant CNN-based recognition system. The proposed FaNC method normalizes pose and facial proportions while retaining expression information and runs in less than 2 ms. When comparing results achieved by training a CNN on the output images of FaNC and other normalization methods, FaNC generalizes significantly better than others to unseen poses if they deviate more than 20° from the poses available during training. Code and data are available.
机译:面部表情分析受到有关头部姿势,身份,照明,遮挡以及表情本身的众多自由度的挑战。当前,似乎很难用训练通用的性能良好的表情识别系统的数据密集地覆盖这个巨大的空间。在本文中,我们解决了归纳为训练数据中未见的头部姿势的子挑战,目的是与姿势子空间的稀疏覆盖相处。为此,我们(1)提出了一种新颖的人脸归一化方法,称为FaNC,可大幅减少姿态引起的图像差异; (2)我们比较建议的和其他标准化方法对(a)使用FERA 2017挑战数据(达到最新技术水平)的动作单位强度估计和(b)使用Multi-PIE数据集的面部表情识别的影响; (3)我们讨论了训练基于姿势不变的CNN的识别系统所需的头部姿势分布。所提出的FaNC方法在保留表情信息的同时标准化姿势和面部比例,并且运行时间少于2毫秒。当比较通过在FaNC的输出图像上训练CNN和其他归一化方法获得的结果时,如果FaNC从训练过程中可用的姿势偏离了超过20°,则FanNC的泛化效果明显优于其他姿势。代码和数据可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号