Person identification using automatic integration of speech, lip, and face experts

机译：使用语音，嘴唇和面部专家的自动集成进行人员识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a multi-expert person identification system based on the integration of three separate systems employing audio features, static face images and lip motion features respectively. Audio person identification was carried out using a text dependent Hidden Markov Model methodology. Modeling of the lip motion was carried out using Gaussian probability density functions. The static image based identification was carried out using the FaceIt system. Experiments were conducted with 251 subjects from the XM2VTS audio-visual database. Late integration using automatic weights was employed to combine the three experts. The integration strategy adapts automatically to the audio noise conditions. It was found that the integration of the three experts improved the person identification accuracies for both clean and noisy audio conditions compared with the audio only case. For audio, FaceIt, lip motion, and tri-expert identification, maximum accuracies achieved were 98%, 93.22%, 86.37% and 100%respectively. Maximum bi-expert integration of the two visual experts achieved an identification accuracy of 96.8% which is comparable to the best audio accuracy of 98%.

机译：本文提出了一个多专家身份识别系统，该系统基于三个分别使用音频功能，静态面部图像和嘴唇运动功能的独立系统的集成。使用文本相关的隐式马尔可夫模型方法进行音频人识别。使用高斯概率密度函数对嘴唇运动进行建模。使用FaceIt系统执行基于静态图像的识别。对来自XM2VTS视听数据库的251名受试者进行了实验。使用自动权重的后期集成来组合这三位专家。集成策略可自动适应音频噪声条件。结果发现，与仅使用音频的情况相比，三位专家的整合改善了干净和嘈杂音频条件下的人员识别准确性。对于音频，FaceIt，嘴唇动作和三专家识别，获得的最大准确性分别为98％，93.22％，86.37％和100％。两位视觉专家的最大双专家集成达到了96.8％的识别准确度，可与98％的最佳音频准确度相提并论。

著录项

来源
《Proceedings of the 2003 ACM SIGMM workshop on Biometrics methods and applications》|2003年|P.25-32|共8页
会议地点 Berkley CA(US)
作者
Niall A. Fox; Ralph Gross; Philip de Chazal; Jeffery F. Cohn; Richard B. Reilly;
展开▼
作者单位

University College Dublin, Dublin, Ireland;

Carnegie Mellon University,Pittsburgh, PA;

Carnegie Mellon University, Pittsburgh, PA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
person identification;

机译：人身识别;

相似文献

外文文献
中文文献
专利

1. Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts [J] . Fox N.A., Gross R., Cohn J.F., IEEE transactions on multimedia . 2007,第4期

机译：使用语音，嘴巴和面部专家的自动分类器融合对生物特征进行可靠的识别
2. Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition. [J] . Schuster M, Maier A, Haderlein T, International journal of pediatric otorhinolaryngology . 2006,第10期

机译：通过自动语音识别评估唇left裂儿童的语音清晰度。
3. Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition [J] . Aparna Brahme, Umesh Bhadade Indian Journal of Science and Technology . 2016,第32期

机译：基于视觉语音识别的受限局部模型用于口语识别的嘴唇检测和嘴唇几何特征提取
4. Person identification using automatic integration of speech, lip, and face experts [C] . Niall A. Fox, Ralph Gross, Philip de Chazal, ACM SIGMM workshop on Biometrics methods and applications . 2003

机译：使用语音，唇缘和面部专家的自动集成人员识别
5. Automatic speech code identification with application to tampering detection of speech recordings. [D] . Zhou, Jingting. 2011

机译：自动语音代码识别，可用于篡改语音记录。
6. Music expertise shapes audiovisual temporal integration windows for speech sinewave speech and music [O] . Hweeling Lee, Uta Noppeney 2014

机译：音乐专业知识塑造了语音正弦波语音和音乐的视听时间整合窗口
7. Person Identification Using Automatic Integration of Speech, Lip, and Face Experts [O] . Niall Fox, Jeffery F. Cohn, Ralph Gross, 2003

机译：使用语音，唇部和面部专家自动整合进行人员识别

Person identification using automatic integration of speech, lip, and face experts

摘要

著录项

相似文献

相关主题

期刊订阅