首页> 外文会议>IEEE International Conference on Computer Vision >Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
【24h】

Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

机译:未经标记视频中面部识别的无监督域适应

获取原文

摘要

Despite rapid advances in face recognition, there remains a clear gap between the performance of still image-based face recognition and video-based face recognition, due to the vast difference in visual quality between the domains and the difficulty of curating diverse large-scale video datasets. This paper addresses both of those challenges, through an image to video feature-level domain adaptation approach, to learn discriminative video frame representations. The framework utilizes large-scale unlabeled video data to reduce the gap between different domains while transferring discriminative knowledge from large-scale labeled still images. Given a face recognition network that is pretrained in the image domain, the adaptation is achieved by (i) distilling knowledge from the network to a video adaptation network through feature matching, (ii) performing feature restoration through synthetic data augmentation and (iii) learning a domain-invariant feature through a domain adversarial discriminator. We further improve performance through a discriminator-guided feature fusion that boosts high-quality frames while eliminating those degraded by video domain-specific factors. Experiments on the YouTube Faces and IJB-A datasets demonstrate that each module contributes to our feature-level domain adaptation framework and substantially improves video face recognition performance to achieve state-of-the-art accuracy. We demonstrate qualitatively that the network learns to suppress diverse artifacts in videos such as pose, illumination or occlusion without being explicitly trained for them.
机译:尽管对面部识别进行了快速进步,但由于域之间的视觉质量的巨大差异,仍然存在仍然存在明显的差距,并且域之间的视觉质量差异以及巩固各种大规模视频的难度数据集。本文通过图像到视频特征级域适应方法来解决这些挑战,以学习鉴别的视频帧表示。该框架利用大规模未标记的视频数据来减少不同域之间的间隙,同时从大规模标记的静止图像转移鉴别知识。给定在图像域中预先预留的面部识别网络,通过(i)通过特征匹配,(ii)通过合成数据增强和(iii)学习执行特征恢复的特征匹配来实现从网络的蒸馏到视频自适应网络来实现自适应通过域对抗鉴别器的域不变功能。我们通过鉴别者引导的特征融合进一步提高了性能,可以提高高质量帧,同时消除由视频域特定因素降级的人。 YouTube Faces和IJB-A数据集上的实验表明,每个模块都有助于我们的特征级域适应框架,并大大提高了视频面识别性能,以实现最先进的准确性。我们在定性上展示了网络学习在诸如姿势,照明或遮挡等视频中抑制不同的伪像,而不会为它们明确培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号