首页> 外文会议>Pattern Recognition and Machine Intelligence >Audio Visual Speaker Verification Based on Hybrid Fusion of Cross Modal Features
【24h】

Audio Visual Speaker Verification Based on Hybrid Fusion of Cross Modal Features

机译:基于交叉模态特征混合融合的视听说话人验证

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose hybrid fusion of audio and explicit correlation features for speaker identity verification applications. Experiments were performed with the GMM based speaker models with a hybrid fusion technique involving late fusion of explicit cross-modal fusion features, with implicit eigen lip and audio MFCC features. An evaluation of the system performance with different gender specific datasets from controlled VidTIMIT data base and opportunistic UCBN database shows a significant performance improvement.
机译:在本文中,我们提出了音频和显式相关特征的混合融合,以用于说话人身份验证应用。实验是在基于GMM的扬声器模型上进行的,该模型采用了混合融合技术,其中包括显式交叉模态融合特征的后期融合,以及隐含的特征嘴唇和音频MFCC特征。对来自受控VidTIMIT数据库和机会性UCBN数据库的具有不同性别特定数据集的系统性能进行评估显示出显着的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号