首页> 外文会议>Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications >Person identification using automatic integration of speech, lip, and face experts
【24h】

Person identification using automatic integration of speech, lip, and face experts

机译:使用语音,嘴唇和面部专家的自动集成进行人员识别

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a multi-expert person identification system based on the integration of three separate systems employing audio features, static face images and lip motion features respectively. Audio person identification was carried out using a text dependent Hidden Markov Model methodology. Modeling of the lip motion was carried out using Gaussian probability density functions. The static image based identification was carried out using the FaceIt system. Experiments were conducted with 251 subjects from the XM2VTS audio-visual database. Late integration using automatic weights was employed to combine the three experts. The integration strategy adapts automatically to the audio noise conditions. It was found that the integration of the three experts improved the person identification accuracies for both clean and noisy audio conditions compared with the audio only case. For audio, FaceIt, lip motion, and tri-expert identification, maximum accuracies achieved were 98%, 93.22%, 86.37% and 100%respectively. Maximum bi-expert integration of the two visual experts achieved an identification accuracy of 96.8% which is comparable to the best audio accuracy of 98%.
机译:本文提出了一个多专家身份识别系统,该系统基于三个分别使用音频功能,静态面部图像和嘴唇运动功能的独立系统的集成。使用文本相关的隐式马尔可夫模型方法进行音频人识别。使用高斯概率密度函数对嘴唇运动进行建模。使用FaceIt系统执行基于静态图像的识别。对来自XM2VTS视听数据库的251名受试者进行了实验。使用自动权重的后期集成来组合这三位专家。集成策略可自动适应音频噪声条件。结果发现,与仅使用音频的情况相比,三位专家的整合改善了干净和嘈杂音频条件下的人员识别准确性。对于音频,FaceIt,嘴唇动作和三专家识别,获得的最大准确性分别为98%,93.22%,86.37%和100%。两位视觉专家的最大双专家集成达到了96.8%的识别准确度,可与98%的最佳音频准确度相提并论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号