【24h】

BAYESIAN SEPARATION OF AUDIO-VISUAL SPEECH SOURCES

机译:贝叶斯分离视听语音源

获取原文

摘要

In this paper we investigate the use of audio and visual rather than only audio features for the task of speech separation in acoustically noisy environments. The success of existing independent component analysis (ICA) systems for the separation of a large variety of signals, including speech, is often limited by the ability of this technique to handle noise. In this paper, we introduce a Bayesian model for the mixing process that describes both the bimodality and the time dependency of speech sources. Our experimental results show that the online demixing process presented here outperforms both the ICA and the audio-only Bayesian model at all levels of noise.
机译:在本文中,我们调查了音频和视觉的使用,而不是在声学嘈杂环境中的语音分离任务的使用。用于分离各种信号的现有独立分量分析(ICA)系统的成功往往受到这种技术处理噪声的能力的限制。在本文中,我们介绍了一种贝叶斯模型,用于混合过程,所述混合过程描述了语音源的双极性和时间依赖性。我们的实验结果表明,这里展示了在这里的在线解泥过程优于所有噪声水平的ICA和唯一的贝叶斯模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号