首页> 外文OA文献 >Cascaded collaborative regression for robust facial landmark detection trained using a mixture of synthetic and real images with dynamic weighting
【2h】

Cascaded collaborative regression for robust facial landmark detection trained using a mixture of synthetic and real images with dynamic weighting

机译:级联协作回归技术,通过混合使用动态加权的合成图像和真实图像训练鲁棒的面部标志

摘要

A large amount of training data is usually crucial for successful supervised learning. However, the task of providing training samples is often time-consuming, involving a considerable amount of tedious manual work. Also the amount of training data available is often limited. As an alternative, in this paper, we discuss how best to augment the available data for the application of automatic facial landmark detection (FLD). We propose the use of a 3D morphable face model to generate synthesised faces for a regression-based detector training. Benefiting from the large synthetic training data, the learned detector is shown to exhibit a better capability to detect the landmarks of a face with pose variations. Furthermore, the synthesised training dataset provides accurate and consistent landmarks as compared to using manual landmarks, especially for occluded facial parts. The synthetic data and real data are from different domains; hence the detector trained using only synthesised faces does not generalise well to real faces. To deal with this problem, we propose a cascaded collaborative regression (CCR) algorithm, which generates a cascaded shape updater that has the ability to overcome the difficulties caused by pose variations, as well as achieving better accuracy when applied to real faces. The training is based on a mix of synthetic and real image data with the mixing controlled by a dynamic mixture weighting schedule. Initially the training uses heavily the synthetic data, as this can model the gross variations between the various poses. As the training proceeds, progressively more of the natural images are incorporated, as these can model finer detail. To improve the performance of the proposed algorithm further, we designed a dynamic multi-scale local feature extraction method, which captures more informative local features for detector training. An extensive evaluation on both controlled and uncontrolled face datasets demonstrates the merit of the proposed algorithm.
机译:大量的培训数据通常对于成功的监督学习至关重要。但是,提供训练样本的任务通常很耗时,涉及大量繁琐的手工工作。而且,可用的训练数据量通常受到限制。作为替代方案,在本文中,我们将讨论如何最好地增加可用数据,以用于自动面部标志检测(FLD)。我们建议使用3D可变形人脸模型为基于回归的探测器训练生成合成人脸。受益于大量的综合训练数据,学习型检测器表现出更好的检测姿势变化的面部特征的能力。此外,与使用人工界标相比,合成的训练数据集提供准确且一致的界标,尤其是对于遮挡的面部部位。合成数据和真实数据来自不同的领域。因此,仅使用合成人脸训练的检测器不能很好地推广到真实人脸。为解决此问题,我们提出了一种级联协作回归(CCR)算法,该算法生成一个级联形状更新器,该级联形状更新器有能力克服由姿势变化引起的困难,并且在应用于真实面孔时可以获得更高的准确性。训练基于合成和真实图像数据的混合,混合由动态混合物权重计划控制。最初,训练会大量使用合成数据,因为这可以为各种姿势之间的总体变化建模。随着训练的进行,将逐渐合并更多的自然图像,因为这些自然图像可以建模更精细的细节。为了进一步提高所提出算法的性能,我们设计了一种动态的多尺度局部特征提取方法,该方法可以捕获更多信息丰富的局部特征以进行检测器训练。对受控和非受控面部数据集的广泛评估证明了所提出算法的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号