首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Robust front-end for speech recognition by human and machine in noisy reverberant environments: The effect of phase information
【24h】

Robust front-end for speech recognition by human and machine in noisy reverberant environments: The effect of phase information

机译:在嘈杂的混响环境中,人机识别语音的强大前端:相位信息的影响

获取原文

摘要

This paper proposes a robust front-end for speech applications based on restoration scheme of instantaneous amplitude and phase. Typical applications such as hearing aids and automatic speech recognition systems still have challenging issues with regard to robustness against noise and reverberation. The proposed front-end employed a combination of our previously proposed method for restoring instantaneous amplitude and phase on a Gammatone filterbank and cepstral mean normalization (CMN). The first method can remove late reverberated and additive noise components from the observed speech, while the second method can remove the early reflection. In this paper, we comparatively evaluated the proposed method with other typical methods as robust front-end for speech recognition by human and machine in noisy reverberant environments. Modified Rhyme tests and word recognition tests were carried out as speech recognition by human and machine. The results of both evaluations revealed that the proposed front-end could effectively improve correctness of speech intelligibility and word recognition rate in noisy reverberant environments. In addition, effect of phase information was found to greatly improve the quality and intelligibility of speech.
机译:本文提出了一种基于瞬时幅度和相位恢复方案的语音应用鲁棒前端。诸如助听器和自动语音识别系统之类的典型应用在针对噪声和混响的鲁棒性方面仍然具有挑战性的问题。拟议的前端采用了我们先前提出的方法的组合,用于恢复Gammatone滤波器组上的瞬时幅度和相位以及倒频谱平均归一化(CMN)。第一种方法可以从观察到的语音中消除后期的混响和加性噪声成分,而第二种方法可以消除早期的反射。在本文中,我们将本方法与其他典型方法进行了比较评估,作为在嘈杂混响环境中人和机器进行语音识别的鲁棒前端。进行了修改的Rhyme测试和单词识别测试,作为人和机器的语音识别。两项评估的结果都表明,在嘈杂的混响环境中,提出的前端可以有效提高语音清晰度和单词识别率的正确性。另外,发现相位信息的效果大大提高了语音的质量和清晰度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号