首页> 外文期刊>Computer vision and image understanding >Multimodal person authentication using speech, face and visual speech
【24h】

Multimodal person authentication using speech, face and visual speech

机译:使用语音,面部和视觉语音的多模式人员身份验证

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a method for automatic multimodal person authentication using speech, face and visual speech modalities. The proposed method uses the motion information to localize the face region, and the face region is processed in YC_rC_b, color space to determine the locations of the eyes. The system models the nonlip region of the face using a Gaussian distribution, and it is used to estimate the center of the mouth. Facial and visual speech features are extracted using multiscale morphological erosion and dilation operations, respectively. The facial features are extracted relative to the locations of the eyes, and visual speech features are extracted relative to the locations of the eyes and mouth. Acoustic features are derived from the speech signal, and are represented by weighted linear prediction cepstral coefficients (WLPCC). Autoassociative neural network (AANN) models are used to capture the distribution of the extracted acoustic, facial and visual speech features. The evidence from speech, face and visual speech models are combined using a weighting rule, and the result is used to accept or reject the identity claim of the subject. The performance of the system is evaluated for newsreaders in TV broadcast news data, and the system achieves an equal error rate (EER) of about 0.45% for 50 subjects.
机译:本文提出了一种使用语音,面部和视觉语音方式进行自动多模式人员身份验证的方法。所提出的方法使用运动信息来定位脸部区域,并且在YC_rC_b,颜色空间中处理脸部区域以确定眼睛的位置。该系统使用高斯分布对脸部的非嘴唇区域进行建模,并用于估计嘴巴的中心。分别使用多尺度形态学侵蚀和扩张操作提取面部和视觉语音特征。相对于眼睛的位置提取面部特征,并且相对于眼睛和嘴的位置提取视觉语音特征。声音特征是从语音信号中得出的,并由加权线性预测倒谱系数(WLPCC)表示。自联想神经网络(AANN)模型用于捕获提取的声学,面部和视觉语音特征的分布。来自语音,面部和视觉语音模型的证据使用权重规则进行组合,结果用于接受或拒绝对象的身份声明。对电视广播新闻数据中的新闻阅读器评估了该系统的性能,对于50个主题,该系统实现了约0.45%的均等错误率(EER)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号