Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

Longbiao WANG; Norihide KITAOKA; Seiichi NAKAGAWA

首页> 外文期刊>IEICE transactions on information and systems >Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

【24h】

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

机译：基于多通道LMS算法的频谱减法的远程谈话语音识别

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a blind dereverberation method based on spectral subtraction using a multi-channel least mean squares (MCLMS) algorithm for distant-talking speech recognition. In a distant-talking environment, the channel impulse response is longer than the short-term spectral analysis window. By treating the late reverberation as additive noise, a noise reduction technique based on spectral subtraction was proposed to estimate the power spectrum of the clean speech using power spectra of the distorted speech and the unknown impulse responses. To estimate the power spectra of the impulse responses, a variable step-size unconstrained MCLMS (VSS-UMCLMS) algorithm for identifying the impulse responses in a time domain is extended to a frequency domain. To reduce the effect of the estimation error of the channel impulse response, we normalize the early reverberation by cepstral mean normalization (CMN) instead of spectral subtraction using the estimated impulse response. Furthermore, our proposed method is combined with conventional delay-and-sum beamforming. We conducted recognition experiments on a distorted speech signal simulated by convolving multi-channel impulse responses with clean speech. The proposed method achieved a relative error reduction rate of 22.4% in relation to conventional CMN. By combining the proposed method with beamforming, a relative error reduction rate of 24.5% in relation to the conventional CMN with beamforming was achieved using only an isolated word (with duration of about 0.6s) to estimate the spectrum of the impulse response.

机译：我们提出了一种基于使用多声道最少均方块（MCLMS）算法的频谱减法的盲人DERERATION方法，用于遥远的语音识别。在遥控环境中，信道脉冲响应长于短期谱分析窗口。通过将后期混响作为附加噪声，提出了一种基于光谱减法的降噪技术，估计使用扭曲语音的功率谱和未知的脉冲响应的功率谱来估计清洁语音的功率谱。为了估计脉冲响应的功率谱，用于识别时域中的脉冲响应的可变步长无约会MCLM（VSS-UMCLMS）算法扩展到频域。为了减少信道脉冲响应的估计误差的效果，我们通过使用估计的脉冲响应来规范临时归一化（CMN）而不是光谱减法的预先反振。此外，我们所提出的方法与传统的延迟和和波束形成相结合。我们对通过卷积多通道脉冲响应的扭曲语音信号进行了识别实验。所提出的方法与传统CMN相对于传统CMN相对误差降低率为22.4％。通过将提出的方法与波束成形的结合相结合，仅使用仅与孤立的字（具有约0.6s）的隔离字（具有约0.6s）的传统CMN的相对误差降低率为与波束成形相比。估计脉冲响应的频谱。

著录项

来源
《IEICE transactions on information and systems 》 |2011年第3期| 共9页
作者
Longbiao WANG; Norihide KITAOKA; Seiichi NAKAGAWA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm [J] . Longbiao WANG, Norihide KITAOKA, Seiichi NAKAGAWA IEICE Transactions on Information and Systems . 2011 ,第3期

机译：基于多通道LMS算法的谱相减的远距离语音识别
2. Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array [J] . Longbiao Wang, Kyohei Odani, Atsuhiko Kai EURASIP journal on advances in signal processing . 2012 ,第1期

机译：基于小规模麦克风阵列的多通道LMS算法基于广义谱减法的混响和去噪
3. Dual-channel spectral subtraction algorithms based speech enhancement dedicated to a bilateral cochlear implant [J] . Fathi Kallel, Mondher Frikha, Mohamed Ghorbel, Applied Acoustics . 2012 ,第1期

机译：基于双通道频谱减法的语音增强专用于双侧耳蜗植入
4. Evaluation of Hands-Free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm [C] . Longbiao Wang, Kyohei Odani, Atsuhiko Kai Text, speech and dialogue . 2011

机译：基于谱减法的多通道LMS算法盲去混响评估免提大词汇量连续语音识别
5. Feature-based speech enhancement techniques based on spectral subtraction and Wiener filtering [D] . Chan, Mike Veng-Hang 1999

机译：基于频谱减法和维纳滤波的基于特征的语音增强技术
6. EEG Signal Description with Spectral-Envelope-Based Speech Recognition Features for Detection of Neonatal Seizures [O] . Andriy Temko, Climent Nadeu, William Marnane, -1

机译：EEG信号描述与基于光谱包络的语音识别特征用于检测新生儿癫痫发作
7. Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array [O] . Longbiao Wang, Kyohei Odani, Atsuhiko Kai 2012

机译：基于小规模麦克风阵列的多通道LMS算法基于广义谱减法的混响和去噪

Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅