首页> 外文期刊>Image and Vision Computing >Composite recurrent network with internal denoising for facial alignment in still and video images in the wild
【24h】

Composite recurrent network with internal denoising for facial alignment in still and video images in the wild

机译:复合经常性网络与野外静止和视频图像中的面部对准的内部去噪

获取原文
获取原文并翻译 | 示例
           

摘要

Facial alignment is an essential task for many higher level facial analysis applications, such as animation, human activity recognition and human -computer interaction. Although the recent availability of big datasets and powerful deep-learning approaches have enabled major improvements on the state of the art accuracy, the performance of current approaches can severely deteriorate when dealing with images in highly unconstrained conditions, which limits the real-life applicability of such models. In this paper, we propose a composite recurrent tracker with internal denoising that jointly address both single image facial alignment and deformable facial tracking in the wild. Specifically, we incorporate multilayer LSTMs to model temporal dependencies with variable length and introduce an internal denoiser which selectively enhances the input images to improve the robustness of our overall model. We achieve this by combining 4 different sub-networks that specialize in each of the key tasks that are required, namely face detection, bounding-box tracking, facial region validation and facial alignment with internal denoising. These blocks are endowed with novel algorithms resulting in a facial tracker that is both accurate, robust to in-the-wild settings and resilient against drifting. We demonstrate this by testing our model on 300-W and Menpo datasets for single image facial alignment, and 300-VW dataset for deformable facial tracking. Comparison against 20 other state of the art methods demonstrates the excellent performance of the proposed approach. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
机译:面部对齐是许多更高级别的面部分析应用的重要任务,例如动画,人类活动识别和人机交互。虽然最近的大数据集和强大的深度学习方法已经启用了最先进的最新性能的主要改进,但是当在高度无规约条件下处理图像时,当前方法的性能可能会严重恶化,这限制了现实生活的适用性这些模型。在本文中,我们提出了一种具有内部去噪的复合复制跟踪器,其共同地解决了野外的单个图像面部对准和可变形的面部跟踪。具体而言,我们将多层LSTM纳入模拟具有可变长度的时间依赖性,并引入内部置位器,其选择性地增强输入图像以提高我们整体模型的鲁棒性。我们通过组合4种不同的子网来实现这一目标,该子网专门从事所需的每个关键任务,即面部检测,边界箱跟踪,面部区域验证和内部去噪的面部校准。这些块具有新颖的算法,导致面部跟踪器,其既准确,对野外设置又鲁棒,并弹性逆转。我们通过在300-W和MENPO数据集中测试单个图像面部对准的模型和可变形面部跟踪的300 VW数据集来证明这一点。对20个其他最先进方法的比较显示了所提出的方法的优异性能。 (c)2021提交人。由elsevier b.v发布。这是CC下的开放式访问文章,由许可证(http:// creativecommons.org/licenses/by/4.0/)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号