首页> 外文期刊>IEEE transactions on audio, speech and language processing >Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures
【24h】

Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures

机译:混合视听语音处理和盲源分离,从卷积混合物中提取语音信号

获取原文
获取原文并翻译 | 示例

摘要

Looking at the speaker's face can be useful to better hear a speech signal in noisy environment and extract it from competing sources before identification. This suggests that the visual signals of speech (movements of visible articulators) could be used in speech enhancement or extraction systems. In this paper, we present a novel algorithm plugging audiovisual coherence of speech signals, estimated by statistical tools, on audio blind source separation (BSS) techniques. This algorithm is applied to the difficult and realistic case of convolutive mixtures. The algorithm mainly works in the frequency (transform) domain, where the convolutive mixture becomes an additive mixture for each frequency channel. Frequency by frequency separation is made by an audio BSS algorithm. The audio and visual informations are modeled by a newly proposed statistical model. This model is then used to solve the standard source permutation and scale factor ambiguities encountered for each frequency after the audio blind separation stage. The proposed method is shown to be efficient in the case of 2 times 2 convolutive mixtures and offers promising perspectives for extracting a particular speech source of interest from complex mixtures
机译:看着说话者的脸有助于在嘈杂的环境中更好地听到语音信号,并在识别之前从竞争来源中提取语音信号。这表明语音的视觉信号(可见发音器的运动)可用于语音增强或提取系统。在本文中,我们提出了一种新的算法,该算法可通过统计工具估计的语音盲源分离(BSS)技术来插入语音信号的视听相干性。该算法适用于卷积混合物的困难和现实情况。该算法主要在频率(变换)域内工作,其中卷积混合物成为每个频道的加性混合物。通过音频BSS算法进行逐频分离。音频和视频信息由新提议的统计模型建模。然后,该模型用于解决在音频盲分离阶段之后每个频率遇到的标准源置换和比例因子歧义。所提出的方法在2乘2的卷积混合物的情况下被证明是有效的,并为从复杂混合物中提取特定的感兴趣语音源提供了有希望的前景

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号