首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function
【24h】

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

机译:从概率角度出发优化时变声传递函数的语音去混响

获取原文
获取原文并翻译 | 示例

摘要

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates less distortion than the NRS alone. The three components are optimally combined from a probabilistic perspective using a unified likelihood function incorporating two probabilistic models. A multichannel probabilistic source model based on a recently proposed local Gaussian model (LGM) provides robustness against ATF fluctuations of the early reflection. A probabilistic reverberant transfer function model (PRTFM) provides robustness against ATF fluctuations of the late reverberation. The MIF and multichannel under-determined source separation (MUSS) are optimized in an iterative manner. The MIF is designed to reduce the time-invariant part of the late reverberation by using optimal time-weighting with reference to the PRTFM and the LGM. The MUSS separates the dereverberated speech signal and the residual reverberation after the MIF, which can be interpreted as an optimized combination of the BF and the NRS. The parameters of the PRTFM and the LGM are optimized based on the MUSS output. Experimental results show that the proposed method is robust against the ATF fluctuations under both single and multiple source conditions.
机译:已经开发了一种去混响技术,该技术可以最佳地组合多通道逆滤波(MIF),波束形成(BF)和非线性混响抑制(NRS)。它对声学传递函数(ATF)波动具有鲁棒性,并且比单独使用NRS产生的失真小。使用结合了两个概率模型的统一似然函数,从概率角度将这三个组件最佳地组合在一起。基于最近提出的局部高斯模型(LGM)的多通道概率源模型提供了针对早期反射的ATF波动的鲁棒性。概率混响传递函数模型(PRTFM)可以抵抗后期混响的ATF波动。 MIF和多通道不确定源分离(MUSS)以迭代方式进行了优化。 MIF旨在通过参考PRTFM和LGM使用最佳时间加权来减少后期混响的时不变部分。 MUSS在MIF之后将去皮语音信号和残留混响分离,这可以解释为BF和NRS的优化组合。基于MUSS输出优化PRTFM和LGM的参数。实验结果表明,该方法在单源和多源条件下均能抵抗ATF波动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号