Person identification using automatic integration of speech, lip, and face experts

机译：使用语音，唇缘和面部专家的自动集成人员识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a multi-expert person identification system based on the integration of three separate systems employing audio features, static face images and lip motion features respectively. Audio person identification was carried out using a text dependent Hidden Markov Model methodology. Modeling of the lip motion was carried out using Gaussian probability density functions. The static image based identification was carried out using the FaceIt system. Experiments were conducted with 251 subjects from the XM2VTS audio-visual database. Late integration using automatic weights was employed to combine the three experts. The integration strategy adapts automatically to the audio noise conditions. It was found that the integration of the three experts improved the person identification accuracies for both clean and noisy audio conditions compared with the audio only case. For audio, FaceIt, lip motion, and tri-expert identification, maximum accuracies achieved were 98%, 93.22%, 86.37% and 100%respectively. Maximum bi-expert integration of the two visual experts achieved an identification accuracy of 96.8% which is comparable to the best audio accuracy of 98%.

机译：本文介绍了一种多专家人识别系统，基于集成三个采用音频特征，静态图像和唇部运动特征的单独系统。使用文本依赖隐藏的马尔可夫模型方法进行音频人员识别。使用高斯概率密度函数进行唇部运动的建模。基于静态图像的识别使用面部系统进行。使用来自XM2VTS音频视觉数据库的251个受试者进行实验。采用自动重量的延迟整合将三位专家组合起来。集成策略自动适应音频噪声条件。有人发现，与音频唯一的情况相比，三位专家的整合改善了清洁和嘈杂的音频条件的识别准确性。对于音频，面部面，唇部运动和三级专家鉴定，实现的最大精度分别为98％，93.22％，86.37％和100％。两台视觉专家的最高双专家集成实现了96.8％的识别精度，与最佳音频精度相比为98％。

著录项

来源
《ACM SIGMM workshop on Biometrics methods and applications》|2003年||共8页
会议地点
作者
Niall A. Fox; Ralph Gross; Philip de Chazal; Jeffery F. Cohn; Richard B. Reilly;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
person identification;

机译：人身份证;

相似文献

外文文献
中文文献
专利

1. Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts [J] . Fox N.A., Gross R., Cohn J.F., IEEE transactions on multimedia . 2007,第4期

机译：使用语音，嘴巴和面部专家的自动分类器融合对生物特征进行可靠的识别
2. Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition. [J] . Schuster M, Maier A, Haderlein T, International journal of pediatric otorhinolaryngology . 2006,第10期

机译：通过自动语音识别评估唇left裂儿童的语音清晰度。
3. Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition [J] . Aparna Brahme, Umesh Bhadade Indian Journal of Science and Technology . 2016,第32期

机译：基于视觉语音识别的受限局部模型用于口语识别的嘴唇检测和嘴唇几何特征提取
4. Person identification using automatic integration of speech, lip, and face experts [C] . Niall A. Fox, Ralph Gross, Philip de Chazal, Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications . 2003

机译：使用语音，嘴唇和面部专家的自动集成进行人员识别
5. Automatic speech code identification with application to tampering detection of speech recordings. [D] . Zhou, Jingting. 2011

机译：自动语音代码识别，可用于篡改语音记录。
6. Music expertise shapes audiovisual temporal integration windows for speech sinewave speech and music [O] . Hweeling Lee, Uta Noppeney 2014

机译：音乐专业知识塑造了语音正弦波语音和音乐的视听时间整合窗口
7. Person Identification Using Automatic Integration of Speech, Lip, and Face Experts [O] . Niall Fox, Jeffery F. Cohn, Ralph Gross, 2003

机译：使用语音，唇部和面部专家自动整合进行人员识别

Person identification using automatic integration of speech, lip, and face experts

摘要

著录项

相似文献

相关主题

期刊订阅