首页> 外文期刊>EURASIP journal on advances in signal processing >Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli
【24h】

Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli

机译:视听语音源分离:利用语音刺激视听连贯的新方法

获取原文
           

摘要

We present a new approach to the source separation problem in the case of multiple speech signals. The method is based on the use of automatic lipreading, the objective is to extract an acoustic speech signal from other acoustic signals by exploiting its coherence with the speaker′s lip movements. We consider the case of an additive stationary mixture of decorrelated sources, with no further assumptions on independence or non-Gaussian character. Firstly, we present a theoretical framework showing that it is indeed possible to separate a source when some of its spectral characteristics are provided to the system. Then we address the case of audio-visual sources. We show how, if a statistical model of the joint probability of visual and spectral audio input is learnt to quantify the audio-visual coherence, separation can be achieved by maximizing this probability. Finally, we present a number of separation results on a corpus of vowel-plosive-vowel sequences uttered by a single speaker, embedded in a mixture of other voices. We show that separation can be quite good for mixtures of 2, 3, and 5 sources. These results, while very preliminary, are encouraging, and are discussed in respect to their potential complementarity with traditional pure audio separation or enhancement techniques.
机译:我们提出了一种在多个语音信号情况下的源分离问题的新方法。该方法基于自动唇读的使用,目的是通过利用其与说话人嘴唇运动的一致性,从其他声音信号中提取声音语音信号。我们考虑去相关源的加法平稳混合的情况,没有关于独立性或非高斯性的进一步假设。首先,我们提出一个理论框架,表明当某些光源的光谱特性提供给系统时,确实有可能分离光源。然后,我们讨论视听资源的情况。我们展示了,如果学会了视觉和频谱音频输入联合概率的统计模型来量化视听连贯性,那么如何通过使该概率最大化来实现分离。最后,我们给出了由单个说话者发出的,混合在其他声音混合中的元音-爆破-元音序列语料库的许多分离结果。我们表明,对于2、3和5种来源的混合物,分离效果会很好。这些结果虽然非常初步,但却令人鼓舞,并且就其与传统纯音频分离或增强技术的潜在互补性进行了讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号