BAYESIAN SEPARATION OF AUDIO-VISUAL SPEECH SOURCES

机译：贝叶斯分离视听语音源

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we investigate the use of audio and visual rather than only audio features for the task of speech separation in acoustically noisy environments. The success of existing independent component analysis (ICA) systems for the separation of a large variety of signals, including speech, is often limited by the ability of this technique to handle noise. In this paper, we introduce a Bayesian model for the mixing process that describes both the bimodality and the time dependency of speech sources. Our experimental results show that the online demixing process presented here outperforms both the ICA and the audio-only Bayesian model at all levels of noise.

机译：在本文中，我们调查了音频和视觉的使用，而不是在声学嘈杂环境中的语音分离任务的使用。用于分离各种信号的现有独立分量分析（ICA）系统的成功往往受到这种技术处理噪声的能力的限制。在本文中，我们介绍了一种贝叶斯模型，用于混合过程，所述混合过程描述了语音源的双极性和时间依赖性。我们的实验结果表明，这里展示了在这里的在线解泥过程优于所有噪声水平的ICA和唯一的贝叶斯模型。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2004年||共4页
会议地点
作者
Shyamsundar Rajaram; Ara V. Nefian; Thomas S. Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli [J] . David Sodoyer, Jean-Luc Schwartz, Laurent Girin, EURASIP journal on advances in signal processing . 2002,第11期

机译：视听语音源分离：利用语音刺激视听连贯的新方法
2. Developing an audio-visual speech source separation algorithm [J] . Sodoyer D, Girin L, Jutten C, Speech Communication . 2004,第1a4期

机译：开发视听语音源分离算法
3. Audio-Visual Tibetan Speech Recognition Based on a Deep Dynamic Bayesian Network for Natural Human Robot Interaction [J] . Yue Zhao, Hui Wang, Qiang Ji International Journal of Advanced Robotic Systems . 2012,第6期

机译：基于深度动态贝叶斯网络的自然人机交互视听藏语语音识别
4. BAYESIAN SEPARATION OF AUDIO-VISUAL SPEECH SOURCES [C] . Shyamsundar Rajaram, Ara V. Nefian, Thomas S. Huang IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：贝叶斯分离视听语音源
5. On the separation of T Tauri star spectra using non-negative matrix factorization and Bayesian positive source separation. [D] . Kenney, Colleen. 2010

机译：关于使用非负矩阵分解和贝叶斯正源分离的T Tauri星光谱的分离。
6. Audio-visual Resources for Hypertension Education and Audio-visual Resources for Diabetes Education [O] . Fran Bischoff 1982

机译：高血压教育视听资源和糖尿病教育视听资源
7. Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli [O] . David Sodoyer, Jean-Luc Schwartz, Laurent Girin, 2002

机译：视听语音源的分离：利用语音刺激的视听连贯的新方法

BAYESIAN SEPARATION OF AUDIO-VISUAL SPEECH SOURCES

摘要

著录项

相似文献

相关主题

期刊订阅