首页> 外文会议>IEEE International Conference on Intelligent Computer Communication and Processing >Improving automatic speech recognition in noise by energy normalization and signal resynthesis
【24h】

Improving automatic speech recognition in noise by energy normalization and signal resynthesis

机译:通过能量归一化提高噪声的自动语音识别和信号重新合作

获取原文

摘要

This paper presents the contribution of energy normalization technique in automatic speech recognition in babble noise, where machine assumes that speech and noise have the same level of energy, therefore loudness. Similarly, loudness of target speech and noise is an important contributing factor while recognizing speech by humans in everyday conditions. Louder speech is better recognized than non louder speech by humans, even if they are approaching to the listeners at a same signal to noise ratio (SNR). This phenomenon has been tested over the machines and the recognition performance roughly varies from 75% to 90% across a wide range of SNRs. In exchange, human recognition performance is more SNR-dependent: it varies from 30% to 95%. By using energy normalization, the machines have a poor recognition rate in average in comparison to the performance of humans in less noisy conditions (positive SNR), but tend to outperform humans in high noisy conditions (negative SNR like −4dB, −6dB). It is also confirmed by this study that formant processing has no significant effect in recognizing speech in noise. Subsequently, it implies that formant based vocal tract length normalization is unable to improve the performance of machines in noise.
机译:本文介绍了能量标准化技术在禁止噪声中自动语音识别中的贡献,其中机器假定语音和噪声具有相同的能量水平,因此响度。同样,目标语音和噪音的响度是一个重要的贡献因素,同时识别日常条件中的人类的言论。即使它们在与相同的信噪比(SNR)处的听众接近听众的情况下,越来越响亮的言论更好地被人类的言论更好地认可。这种现象已经过测试,识别性能大致不同于各种SNR的75%至90%。在换货上,人类认可性能更加SNR依赖性:它从30%达到9%到95%。通过使用能量标准化,与人类在较少嘈杂的条件下(阳性SNR)的性能相比,机器平均具有差的识别率差,但倾向于以高噪声条件(阴性SNR为-4dB,-6dB)来优于人类。通过这项研究还证实了,即在噪声识别噪音方面没有显着影响。随后,它意味着基于格式的基于的声带长度标准化无法改善噪声中机器的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号