...
首页> 外文期刊>IEICE transactions on information and systems >Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm
【24h】

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

机译:基于多通道LMS算法的频谱减法的远程谈话语音识别

获取原文

摘要

We propose a blind dereverberation method based on spectral subtraction using a multi-channel least mean squares (MCLMS) algorithm for distant-talking speech recognition. In a distant-talking environment, the channel impulse response is longer than the short-term spectral analysis window. By treating the late reverberation as additive noise, a noise reduction technique based on spectral subtraction was proposed to estimate the power spectrum of the clean speech using power spectra of the distorted speech and the unknown impulse responses. To estimate the power spectra of the impulse responses, a variable step-size unconstrained MCLMS (VSS-UMCLMS) algorithm for identifying the impulse responses in a time domain is extended to a frequency domain. To reduce the effect of the estimation error of the channel impulse response, we normalize the early reverberation by cepstral mean normalization (CMN) instead of spectral subtraction using the estimated impulse response. Furthermore, our proposed method is combined with conventional delay-and-sum beamforming. We conducted recognition experiments on a distorted speech signal simulated by convolving multi-channel impulse responses with clean speech. The proposed method achieved a relative error reduction rate of 22.4% in relation to conventional CMN. By combining the proposed method with beamforming, a relative error reduction rate of 24.5% in relation to the conventional CMN with beamforming was achieved using only an isolated word (with duration of about 0.6s) to estimate the spectrum of the impulse response.
机译:我们提出了一种基于使用多声道最少均方块(MCLMS)算法的频谱减法的盲人DERERATION方法,用于遥远的语音识别。在遥控环境中,信道脉冲响应长于短期谱分析窗口。通过将后期混响作为附加噪声,提出了一种基于光谱减法的降噪技术,估计使用扭曲语音的功率谱和未知的脉冲响应的功率谱来估计清洁语音的功率谱。为了估计脉冲响应的功率谱,用于识别时域中的脉冲响应的可变步长无约会MCLM(VSS-UMCLMS)算法扩展到频域。为了减少信道脉冲响应的估计误差的效果,我们通过使用估计的脉冲响应来规范临时归一化(CMN)而不是光谱减法的预先反振。此外,我们所提出的方法与传统的延迟和和波束形成相结合。我们对通过卷积多通道脉冲响应的扭曲语音信号进行了识别实验。所提出的方法与传统CMN相对于传统CMN相对误差降低率为22.4%。通过将提出的方法与波束成形的结合相结合,仅使用仅与孤立的字(具有约0.6s)的隔离字(具有约0.6s)的传统CMN的相对误差降低率为与波束成形相比。估计脉冲响应的频谱。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号