首页> 外文会议>ACM SIGMM workshop on Biometrics methods and applications >Person identification using automatic integration of speech, lip, and face experts
【24h】

Person identification using automatic integration of speech, lip, and face experts

机译:使用语音,唇缘和面部专家的自动集成人员识别

获取原文

摘要

This paper presents a multi-expert person identification system based on the integration of three separate systems employing audio features, static face images and lip motion features respectively. Audio person identification was carried out using a text dependent Hidden Markov Model methodology. Modeling of the lip motion was carried out using Gaussian probability density functions. The static image based identification was carried out using the FaceIt system. Experiments were conducted with 251 subjects from the XM2VTS audio-visual database. Late integration using automatic weights was employed to combine the three experts. The integration strategy adapts automatically to the audio noise conditions. It was found that the integration of the three experts improved the person identification accuracies for both clean and noisy audio conditions compared with the audio only case. For audio, FaceIt, lip motion, and tri-expert identification, maximum accuracies achieved were 98%, 93.22%, 86.37% and 100%respectively. Maximum bi-expert integration of the two visual experts achieved an identification accuracy of 96.8% which is comparable to the best audio accuracy of 98%.
机译:本文介绍了一种多专家人识别系统,基于集成三个采用音频特征,静态图像和唇部运动特征的单独系统。使用文本依赖隐藏的马尔可夫模型方法进行音频人员识别。使用高斯概率密度函数进行唇部运动的建模。基于静态图像的识别使用面部系统进行。使用来自XM2VTS音频视觉数据库的251个受试者进行实验。采用自动重量的延迟整合将三位专家组合起来。集成策略自动适应音频噪声条件。有人发现,与音频唯一的情况相比,三位专家的整合改善了清洁和嘈杂的音频条件的识别准确性。对于音频,面部面,唇部运动和三级专家鉴定,实现的最大精度分别为98%,93.22%,86.37%和100%。两台视觉专家的最高双专家集成实现了96.8%的识别精度,与最佳音频精度相比为98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号