首页> 外文期刊>EURASIP Journal on Audio, Speech, and Music Processing >An iterative model-based approach to cochannel speech separation
【24h】

An iterative model-based approach to cochannel speech separation

机译:基于迭代模型的同信道语音分离方法

获取原文
获取原文并翻译 | 示例
           

摘要

Cochannel speech separation aims to separate two speech signals from a single mixture. In a supervised scenario, the identities of two speakers are given, and current methods use pre-trained speaker models for separation. One issue in model-based methods is the mismatch between training and test signal levels. We propose an iterative algorithm to adapt speaker models to match the signal levels in testing. Our algorithm first obtains initial estimates of source signals using unadapted speaker models and then detects the input signal-to-noise ratio (SNR) of the mixture. The input SNR is then used to adapt the speaker models for more accurate estimation. The two steps iterate until convergence. Compared to search-based SNR detection methods, our method is not limited to given SNR levels. Evaluations demonstrate that the iterative procedure converges quickly in a considerable range of SNRs and improves separation results significantly. Comparisons show that the proposed system performs significantly better than related model-based systems.
机译:同频道语音分离旨在从单个混合物中分离两个语音信号。在有监督的情况下,给出了两个说话者的身份,并且当前的方法使用预先训练的说话者模型进行分离。基于模型的方法中的一个问题是训练和测试信号电平之间的不匹配。我们提出一种迭代算法,以调整扬声器模型以匹配测试中的信号电平。我们的算法首先使用不适合的扬声器模型获得源信号的初始估计,然后检测混合物的输入信噪比(SNR)。输入SNR然后用于调整扬声器模型,以进行更准确的估计。这两个步骤反复进行,直到收敛为止。与基于搜索的SNR检测方法相比,我们的方法不限于给定的SNR级别。评估表明,迭代过程在相当大的SNR范围内快速收敛,并显着改善了分离结果。比较表明,所提出的系统的性能明显优于相关的基于模型的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号