【24h】

Recurrent network-based face recognition using image sequences

机译:使用图像序列的基于网络的循环人脸识别

获取原文

摘要

In this work, we propose a novel method for face recognition with large pose variations in image sequences using a cellular simultaneous recurrent network (CSRN).The pose problem is still a daunting challenge in face recognition. If the image sequences are obtained from different viewpoints in a surveillance type of application, the face recognition rate drops significantly. We formulate the recognition problem for face image sequences with large pose variation as an implicit temporal prediction task for CSRN. Further, to reduce the computational cost, we obtain eigenfaces for a set of image sequences for each person and use these reduced pattern vectors as the input to the CSRN. The CSRN is trained by this pattern vector, and each CSRN learns how to associate each face class/person in the training phase. When a new face is encountered, the corresponding image sequence is projected to each eigenface space to obtain the test pattern vectors. The Euclidian distances between successive frames of test and output pattern vectors indicate either a match or mismatch between the two corresponding face classes. We extensively evaluate our CSRN-based face recognition technique with 5 persons using publicly available VidTIMIT Audio-Video face dataset .In order to verify the performance of the CSRN, we also implement an Elman neural network for comparison. Our simulation shows that for this VidTIMIT Audio-Video face dataset with large pose variation, we can obtain an overall 65% (for rank 1) or 75% (for rank 2) face recognition accuracy better than the 55%(rank 1) recognition accuracy of Elman neural network.
机译:在这项工作中,我们提出了一种使用细胞同时递归网络(CSRN)的图像序列中姿态变化较大的人脸识别新方法。姿势问题仍然是人脸识别中的艰巨挑战。如果在监视类型的应用程序中从不同的角度获得图像序列,则面部识别率会大大下降。我们将具有较大姿态变化的人脸图像序列的识别问题公式化为CSRN的隐式时间预测任务。此外,为了减少计算成本,我们为每个人获取一组图像序列的特征脸,并将这些简化的模式向量用作CSRN的输入。通过该模式向量对CSRN进行训练,并且每个CSRN都将学习如何在训练阶段将每个面部类别/人物关联起来。当遇到新的面部时,将对应的图像序列投影到每个本征面部空间以获得测试图案矢量。测试和输出模式向量的连续帧之间的欧几里得距离表示两个对应的面部类别之间的匹配或不匹配。我们使用公开可用的VidTIMIT音频-视频人脸数据集对5名基于CSRN的人脸识别技术进行了广泛评估。为了验证CSRN的性能,我们还实施了Elman神经网络进行比较。我们的仿真显示,对于具有较大姿势变化的VidTIMIT音频-视频人脸数据集,我们获得的总体面部识别准确度要比55%(等级1)好,达到65%(对于等级1)或75%(对于等级2)。 Elman神经网络的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号