首页> 外文会议>Conference on Computer and Robot Vision >Robust Facial Alignment with Internal Denoising Auto-Encoder
【24h】

Robust Facial Alignment with Internal Denoising Auto-Encoder

机译:带有内部去噪自动编码器的稳固面部对准

获取原文

摘要

The development of facial alignment models is growing rapidly thanks to the availability of large facial landmarked datasets and powerful deep learning models. However, important challenges still remain for facial alignment models to work on images under extreme conditions, such as severe occlusions or large variations in pose and illumination. Current attempts to overcome this limitation have mainly focused on building robust feature extractors with the assumption that the model will be able to discard the noise and select only the meaningful features. However, such an assumption ignores the importance of understanding the noise that characterizes unconstrained images, which has been shown to benefit computer vision models if used appropriately on the learning strategy. Thus, in this paper we investigate the introduction of specialized modules for noise detection and removal, in combination with our state-of-the-art facial alignment module and show that this leads to improved robustness both to synthesized noise and in-the-wild conditions. The proposed model is built by combining two major subnetworks: internal image denoiser (based on the Auto-Encoder architecture) and facial landmark localiser (based on the inception-resnet architecture). Our results on the 300-W and Menpo datasets show that our model can effectively handle different types of synthetic noise, which also leads to enhanced robustness in real-world unconstrained settings, reaching top state-of-the-art accuracy.
机译:面部对齐模型的开发正迅速发展,这要归功于大型面部标志性数据集和强大的深度学习模型的可用性。但是,脸部对准模型在极端条件下(例如,严重的遮挡或姿势和照明的大范围变化)下的图像上工作仍然面临着重要的挑战。当前克服该限制的尝试主要集中在构建健壮的特征提取器上,并假设该模型将能够丢弃噪声并仅选择有意义的特征。但是,这样的假设忽略了理解表征无约束图像的噪声的重要性,如果在学习策略上正确使用噪声,这已被证明有益于计算机视觉模型。因此,在本文中,我们结合最新的面部对准模块,研究了用于噪声检测和消除的专用模块的引入,并表明这可以提高对合成噪声和野外噪声的鲁棒性情况。提出的模型是通过结合两个主要子网构建的:内部图像去噪器(基于自动编码器体系结构)和面部界标定位器(基于初始-resnet体系结构)。我们在300瓦和Menpo数据集上的结果表明,我们的模型可以有效地处理不同类型的合成噪声,这还可以在现实世界中不受约束的设置中增强鲁棒性,达到最高的最新精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号