首页> 外文会议>Advanced concepts for intelligent vision systems >Two-Level Bimodal Association for Audio-Visual Speech Recognition
【24h】

Two-Level Bimodal Association for Audio-Visual Speech Recognition

机译:视听语音识别的两级双峰关联

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic and the visual data streams are combined at the feature level by using the canonical correlation analysis, which deals with the problems of audio-visual synchronization and utilizing the cross-modal correlation. Second, information streams are integrated at the decision level for adaptive fusion of the streams according to the noise condition of the given speech datum. Experimental results demonstrate that the proposed method is effective for producing noise-robust recognition performance without a priori knowledge about the noise conditions of the speech data.
机译:本文提出了一种在视听语音识别中用于双峰信息融合的新方法,该方法将跨峰关联分为两个层次。首先,通过使用规范相关分析在特征级别上组合声音和视觉数据流,从而处理视听同步问题并利用交叉模态相关。其次,根据给定语音数据的噪声条件,在决策层对信息流进行集成,以进行信息流的自适应融合。实验结果表明,该方法在不需要先验知识的语音数据噪声条件的情况下,对于产生噪声鲁棒的识别性能是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号