【24h】

Unsupervised Learning of Face Representations

机译:人脸表示的无监督学习

获取原文
获取原文并翻译 | 示例

摘要

We present an approach for unsupervised training of CNNs in order to learn discriminative face representations. We mine supervised training data by noting that multiple faces in the same video frame must belong to different persons and the same face tracked across multiple frames must belong to the same person. We obtain millions of face pairs from hundreds of videos without using any manual supervision. Although faces extracted from videos have a lower spatial resolution than those which are available as part of standard supervised face datasets such as LFW and CASIA-WebFace, the former represent a much more realistic setting, e.g. in surveillance scenarios where most of the faces detected are very small. We train our CNNs with the relatively low resolution faces extracted from video frames collected, and achieve a higher verification accuracy on the benchmark LFW dataset cf. hand-crafted features such as LBPs, and even surpasses the performance of state-of-the-art deep networks such as VGG-Face, when they are made to work with low resolution input images.
机译:我们提出了一种无监督的CNN训练方法,以学习区分性的面部表情。我们通过注意同一视频帧中的多个面部必须属于不同的人,并且在多个帧中跟踪的同一面部必须属于同一人来挖掘监督的训练数据。我们无需人工监督即可从数百个视频中获得数百万个面部对。尽管从视频中提取的人脸的空间分辨率比标准监督人脸数据集(例如LFW和CASIA-WebFace)中可获取的人的空间分辨率低,但前者代表了更为现实的设置,例如在大多数检测到的人脸很小的监视场景中。我们使用从采集的视频帧中提取的相对较低分辨率的面部训练我们的CNN,并在基准LFW数据集cf上实现更高的验证准确性。当使它们与低分辨率输入图像配合使用时,它们是手工制作的功能(例如LBP),甚至超过了最新的深度网络(例如VGG-Face)的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号