首页> 外文期刊>IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences >Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility
【24h】

Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility

机译:基于和声的混响技术可提高自动语音识别性能和语音清晰度

获取原文
获取原文并翻译 | 示例
           

摘要

A speech signal captured by a distant microphone is generally smeared by reverberation, which severely degrades both the speech intelligibility and Automatic Speech Recognition (ASR) performance. Previously, we proposed a single-microphone dereverberation method, named "Harmonicity based dEReverBeration (HERB)." HERB estimates the inverse filter for an unknown room transfer function by utilizing an essential feature of speech, namely harmonic structure. In previous studies, improvements in speech intelligibility was shown solely with spectrograms, and improvements in ASR performance were simply confirmed by matched condition acoustic model. In this paper, we undertook a further investigation of HERB's potential as regards to the above two factors. First, we examined speech intelligibility by means of objective indices. As a result, we found that HERB is capable of improving the speech intelligibility to approximately that of clean speech. Second, since HERB alone could not improve the ASR performance sufficiently, we further analyzed the HERB mechanism with a view to achieving further improvements. Taking the analysis results into account, we proposed an appropriate ASR configuration and conducted experiments. Experimental results confirmed that, if HERB is used with an ASR adaptation scheme such as MLLR and a multicondition acoustic model, it is very effective for improving ASR performance even in unknown severely reverberant environments.
机译:远处的麦克风捕获的语音信号通常会被混响拖尾,这会严重降低语音清晰度和自动语音识别(ASR)性能。以前,我们提出了一种单麦克风混响方法,称为“基于谐波的dEReverBeration(HERB)”。 HERB通过利用语音的基本特征(即谐波结构)来估计未知房间传递函数的逆滤波器。在先前的研究中,语音清晰度的改善仅通过频谱图显示,而ASR性能的改善仅通过匹配条件声学模型确认。在本文中,我们针对上述两个因素对HERB的潜力进行了进一步的研究。首先,我们通过客观指标研究了语音清晰度。结果,我们发现HERB能够将语音清晰度提高到近似纯净语音。其次,由于仅HERB不能充分改善ASR性能,因此我们进一步分析了HERB机制以实现进一步的改进。考虑到分析结果,我们提出了合适的ASR配置并进行了实验。实验结果证实,如果将HERB与ASR自适应方案(例如MLLR)和多条件声学模型一起使用,则即使在未知的严重混响环境中,HERB对提高ASR性能也非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号