首页> 外文期刊>IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences >Multichannel Two-Stage Beamforming with Unconstrained Beamformer and Distortion Reduction
【24h】

Multichannel Two-Stage Beamforming with Unconstrained Beamformer and Distortion Reduction

机译:无约束波束形成器和减少失真的多通道两阶段波束形成

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a novel multichannel speech enhancement technique for reverberant rooms that is effective when noise sources are spatially stationary, such as a projector fan noise, an air-conditioner noise, and unwanted speech sources at the back of microphones. Speech enhancement performance of the conventional multichannel Wiener filter (MWF) degrades when the Signal-to-Noise Ratio (SNR) of the current microphone input signal changes from the noise-only period. Furthermore, the MWF structure is computationally inefficient, because the MWF updates the whole spatial beamformer periodically to track switching of the speakers (e.g. turn-taking). In contrast to the MWF, the proposed method reduces noise independently of the SNR. The proposed method has a novel two-stage structure, which reduces noise and distortion of the desired source signal in a cascade manner by using two different beam-formers. The first beamformer focuses on noise reduction without any constraint on the desired source, which is insensitive to SNR variation. However, the output signal after the first beamformer is distorted. The second beamformer focuses on distortion reduction of the desired source signal. Theoretically, complete elimination of distortion is assured. Additionally, the proposed method has a computationally efficient structure optimized for spatially stationary noise reduction problems. The first beamformer is updated only when the speech enhancement system is initialized. Only the second beamformer is updated periodically to track switching of the active speaker. The experimental results indicate that the proposed method can reduce spatially stationary noise source signals effectively with less distortion of the desired source signal even in a reverberant conference room.
机译:本文提出了一种适用于混响室的新型多通道语音增强技术,该技术在噪声源在空间上固定时有效,例如投影仪风扇噪声,空调噪声以及麦克风背面的有害语音源。当当前麦克风输入信号的信噪比(SNR)从仅噪声周期变化时,常规多通道Wiener滤波器(MWF)的语音增强性能会下降。此外,由于MWF周期性地更新整个空间波束形成器以跟踪扬声器的切换(例如,转弯),因此MWF结构在计算上效率低下。与MWF相比,该方法可以独立于SNR降低噪声。所提出的方法具有新颖的两级结构,该结构通过使用两个不同的波束形成器以级联方式降低了噪声和所需源信号的失真。第一个波束形成器专注于降低噪声,而对所需信号源没有任何限制,这对SNR变化不敏感。但是,第一波束形成器之后的输出信号会失真。第二个波束形成器专注于减少所需源信号的失真。从理论上讲,可以确保完全消除失真。另外,所提出的方法具有针对空间平稳的降噪问题而优化的计算有效结构。仅在语音增强系统初始化时更新第一波束形成器。仅第二波束形成器被周期性地更新以跟踪有源扬声器的切换。实验结果表明,所提出的方法即使在混响会议室中,也可以有效地减小空间平稳的噪声源信号,且所需源信号的失真较小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号