首页> 外文期刊>IEEE transactions on audio, speech and language processing >Regularization for Partial Multichannel Equalization for Speech Dereverberation
【24h】

Regularization for Partial Multichannel Equalization for Speech Dereverberation

机译:语音去混响的部分多通道均衡的正则化

获取原文
获取原文并翻译 | 示例

摘要

Acoustic multichannel equalization techniques such as the multiple-input/output inverse theorem (MINT), which aim to equalize the room impulse responses (RIRs) between the source and the microphone array, are known to be highly sensitive to RIR estimation errors. To increase robustness, it has been proposed to incorporate regularization in order to decrease the energy of the equalization filters. In addition, more robust partial multichannel equalization techniques such as relaxed multichannel least-squares (RMCLS) and channel shortening (CS) have recently been proposed. In this paper, we propose a partial multichannel equalization technique based on MINT (P-MINT) which aims to shorten the RIR. Furthermore, we investigate the effectiveness of incorporating regularization to further increase the robustness of P-MINT and the aforementioned partial multichannel equalization techniques, i.e., RMCLS and CS. In addition, we introduce an automatic non-intrusive procedure for determining the regularization parameter based on the L-curve. Simulation results using measured RIRs show that incorporating regularization in P-MINT yields a significant performance improvement in the presence of RIR estimation errors, whereas a smaller performance improvement is observed when incorporating regularization in RMCLS and CS. Furthermore, it is shown that the intrusively regularized P-MINT technique outperforms all other investigated intrusively regularized multichannel equalization techniques in terms of perceptual speech quality (PESQ). Finally, it is shown that the automatic non-intrusive regularization parameter in regularized P-MINT leads to a very similar performance as the intrusively determined optimal regularization parameter, making regularized P-MINT a robust, perceptually advantageous, and practically applicable multichannel equalization technique for speech dereverberation.
机译:诸如多输入/输出逆定理(MINT)之类的声学多通道均衡技术,其目的是均衡源和麦克风阵列之间的房间脉冲响应(RIR),对RIR估计误差高度敏感。为了增加鲁棒性,已经提出合并正则化以减少均衡滤波器的能量。另外,近来已经提出了更健壮的部分多信道均衡技术,例如松弛多信道最小二乘(RMCLS)和信道缩短(CS)。在本文中,我们提出了一种基于MINT(P-MINT)的局部多通道均衡技术,旨在缩短RIR。此外,我们研究了合并正则化以进一步提高P-MINT和上述部分多通道均衡技术(即RMCLS和CS)的鲁棒性的有效性。此外,我们介绍了一种基于L曲线来确定正则化参数的自动非侵入式过程。使用测得的RIR进行的仿真结果表明,在存在RIR估计误差的情况下,将正则化合并到P-MINT中可显着改善性能,而在RMCLS和CS中将正则化合并时,观察到较小的性能改进。此外,从感知语音质量(PESQ)的角度来看,表明侵入式正则化P-MINT技术优于所有其他研究的侵入式正则化多通道均衡技术。最后,证明了正则化P-MINT中的自动非侵入式正则化参数所产生的性能与侵入式确定的最佳正则化参数非常相似,从而使正则化P-MINT成为一种鲁棒的,感知上有利的并且在实践中适用的多通道均衡技术语音混响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号