首页> 外文期刊>Digital Signal Processing >Binaural source separation based on spatial cues and maximum likelihood model adaptation
【24h】

Binaural source separation based on spatial cues and maximum likelihood model adaptation

机译:基于空间线索和最大似然模型自适应的双源分离

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a system for separating multiple moving sound sources from two-channel recordings based on spatial cues and a model adaptation technique. We employ a statistical model of observed interaural level and phase differences, where maximum likelihood estimation of model parameters is achieved through an expectation-maximization algorithm. This model is used to partition spectrogram points into several clusters (one cluster per source) and generate spectrogram masks accordingly for isolating individual sound sources. We follow a maximum likelihood linear regression (MLLR) approach for tracking source relocations and adapting model parameters accordingly. The proposed algorithm is able to separate more sources than input channels, i.e. in the underdetermined setting. In simulated anechoic and reverberant environments with two and three speakers, the proposed model-adaptation algorithm yields more than 10 dB gain in signal-to-noise-ratio-improvement for azimuthal source relocations of 15 degrees or more. Moreover, this performance gain is achievable with only 0.6 seconds of input mixture received after relocation. (C) 2014 Elsevier Inc. All rights reserved.
机译:本文介绍了一种基于空间线索和模型自适应技术从两个通道的录音中分离出多个移动声源的系统。我们采用观察到的听觉水平和相位差的统计模型,其中模型参数的最大似然估计是通过期望最大化算法实现的。该模型用于将频谱图点划分为几个群集(每个源一个群集),并相应地生成频谱图掩码,以隔离各个声源。我们遵循最大似然线性回归(MLLR)方法来跟踪源重定位并相应地调整模型参数。所提出的算法能够分离比输入通道更多的源,即在未确定的设置中。在带有两个和三个扬声器的模拟回声和混响环境中,对于15度或以上的方位源重定位,所提出的模型自适应算法在信噪比改善中产生的增益超过10 dB。而且,在重新定位后仅接收0.6秒的输入混合物,就可以实现这种性能提升。 (C)2014 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号