首页> 外国专利> METHODS AND APPARATUS FOR AUDIO-VISUAL SPEAKER RECOGNITION AND UTTERANCE VERIFICATION

METHODS AND APPARATUS FOR AUDIO-VISUAL SPEAKER RECOGNITION AND UTTERANCE VERIFICATION

机译:视听说话人识别和话音验证的方法和装置

摘要

PURPOSE: To provide a method and a device for improving a speaker recognition rate even on acoustic adverse conditions by using visual information together with corresponding audio information during a recognition process. CONSTITUTION: Concerning signals from a video compression source 2, video/ audio data are respectively passed through expandors 10 and 12 by a demultiplexer 8, video goes from a speaker face subdivision module 20 to a visual speaking feature extractor 22 and audio directly goes to an audio feature extractor 14. Data can be exchanged directly from a camera 4 or microphone 6 to the respective extractors as well. The video (audio) data are passed from a face recognition module 24 (audio speaker recognition module 16) through a reliability estimation block 26 (18) and determines a speaker while using score coupling in an identification/confirmation coupling module 30. Besides, a module 32 makes final determination from the output of a speaking confirmation module 26 based on the inputs of the extractors 22 and 14. In this case, the other technique such as feature coupling or re-scoring can be used for the determining method as well.
机译:目的:提供一种方法和设备,即使在声学不利条件下,也可以通过在识别过程中使用视觉信息和相应的音频信息来提高说话者的识别率。组成:关于来自视频压缩源2的信号,视频/音频数据分别由多路分解器8传递通过扩展器10和12,视频从扬声器面部细分模块20到达视觉说话特征提取器22,音频直接进入扬声器。音频特征提取器14。数据也可以直接从相机4或麦克风6交换到各个提取器。视频(音频)数据从面部识别模块24(音频说话者识别模块16)通过可靠性估计块26(18),并在使用识别/确认耦合模块30中的得分耦合时确定说话者。模块32基于提取器22和14的输入从语音确认模块26的输出做出最终确定。在这种情况下,诸如特征耦合或重新评分的其他技术也可以用于确定方法。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号