Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

Togami M.; Kawaguchi Y.; Takeda R.; Obuchi Y.; Nukaga N.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

【24h】

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

机译：从概率角度出发优化时变声传递函数的语音去混响

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates less distortion than the NRS alone. The three components are optimally combined from a probabilistic perspective using a unified likelihood function incorporating two probabilistic models. A multichannel probabilistic source model based on a recently proposed local Gaussian model (LGM) provides robustness against ATF fluctuations of the early reflection. A probabilistic reverberant transfer function model (PRTFM) provides robustness against ATF fluctuations of the late reverberation. The MIF and multichannel under-determined source separation (MUSS) are optimized in an iterative manner. The MIF is designed to reduce the time-invariant part of the late reverberation by using optimal time-weighting with reference to the PRTFM and the LGM. The MUSS separates the dereverberated speech signal and the residual reverberation after the MIF, which can be interpreted as an optimized combination of the BF and the NRS. The parameters of the PRTFM and the LGM are optimized based on the MUSS output. Experimental results show that the proposed method is robust against the ATF fluctuations under both single and multiple source conditions.

机译：已经开发了一种去混响技术，该技术可以最佳地组合多通道逆滤波（MIF），波束形成（BF）和非线性混响抑制（NRS）。它对声学传递函数（ATF）波动具有鲁棒性，并且比单独使用NRS产生的失真小。使用结合了两个概率模型的统一似然函数，从概率角度将这三个组件最佳地组合在一起。基于最近提出的局部高斯模型（LGM）的多通道概率源模型提供了针对早期反射的ATF波动的鲁棒性。概率混响传递函数模型（PRTFM）可以抵抗后期混响的ATF波动。 MIF和多通道不确定源分离（MUSS）以迭代方式进行了优化。 MIF旨在通过参考PRTFM和LGM使用最佳时间加权来减少后期混响的时不变部分。 MUSS在MIF之后将去皮语音信号和残留混响分离，这可以解释为BF和NRS的优化组合。基于MUSS输出优化PRTFM和LGM的参数。实验结果表明，该方法在单源和多源条件下均能抵抗ATF波动。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第7期|1369-1380|共12页
作者
Togami M.; Kawaguchi Y.; Takeda R.; Obuchi Y.; Nukaga N.;
展开▼
作者单位

Central Research Laboratory of Hitachi Ltd., Tokyo, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Microphones; Nonlinear distortion; Probabilistic logic; Reverberation; Robustness; Speech; Transfer functions; Dereverberation; expectation-maximization algorithm; local Gaussian modeling; multichannel filtering; time-varying acoustic transfer function;

机译：麦克风;非线性失真;概率逻辑;混响;坚固性言语;传递函数;去混响;期望最大化算法;局部高斯模型;多通道过滤;时变声传递函数;

相似文献

外文文献
中文文献
专利

1. Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference [J] . Togami M., Kawaguchi Y. Audio, Speech, and Language Processing, IEEE Transactions on . 2014,第11期

机译：同时优化声学回声降低，语音去混响和降低噪声以防止相互干扰
2. Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood [J] . Gomez R., Kawahara T. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第7期

机译：基于声学模型似然性的去混响参数优化的鲁棒语音识别
3. Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model [J] . Nakatani T., Juang B.-H., Yoshioka T., IEEE transactions on audio, speech and language processing . 2008,第8期

机译：基于时变高斯源模型的最大似然估计的语音去混响
4. Online speech dereverberation with time-varying assumption of acoustic transfer functions for teleconferencing systems [C] . Togami Masahito, Kawaguchi Yohei, Nukaga Nobuo 2012 IEEE International Conference on Signal Processing, Communications and Computing. . 2012

机译：电话会议系统中具有声学传递函数的时变假设的在线语音混响
5. Sonic boom minimization through vehicle shape optimization and probabilistic acoustic propagation. [D] . Rallabhandi, Sriram. 2005

机译：通过车辆形状优化和概率性声学传播，使音爆达到最小。
6. Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising [O] . Donald S. Williamson, DeLiang Wang -1

机译：用于语音去混响和去噪的复杂域中的时频屏蔽
7. Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood [O] . Gomez Randy, Kawahara Tatsuya 2010

机译：基于声学模型似然性的去混响参数优化的鲁棒语音识别

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅