首页> 外文会议>Mexican International Conference on Artificial Intelligence >Using Adaptive Filter and Wavelets to Increase Automatic Speech Recognition Rate in Noisy Environment
【24h】

Using Adaptive Filter and Wavelets to Increase Automatic Speech Recognition Rate in Noisy Environment

机译:使用自适应滤波器和小波来提高嘈杂环境中的自动语音识别率

获取原文

摘要

This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. In the experiments, we used several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm) and two different wavelets (Haar and Daubechies). With LMS we obtained an error rate lower than if it was not present and it was better than wavelets employed for this experiment of Automatic Speech Recognition. For decreasing the error rate we trained with 50% of contaminated and originals signals to the ASR system. The results showed in this paper are focused to try analyses the ASR performance in a noisy environment and to demonstrate that if we are controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and we can decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus mentioned before was employed and LMS algorithm was used. Haar wavelet level 1 reached up the most important results as an alternative to LMS algorithm, but only when the noise level was 40 dB and using original corpus.
机译:本文示出了在数字语音文件的自动语音识别(ASR)任务中获得的结果,其具有确定噪声水平浸没的数字语音文件。在实验中,我们使用了包含高斯噪声的几个语音文件。我们在实验中使用了剑桥大学的HTK(隐马尔可夫模型Toolkit)软件。添加到语音信号的噪声水平从十五到四十dB变化,增加了5个单位的步骤。我们使用了自适应滤波来降低级别噪声(它基于最小量度的方形 - 算法)和两个不同的小波(Haar和Daubechies)。使用LMS,我们获得的错误率低于不存在的错误率,并且优于用于该自动语音识别实验的小波。为了降低我们培训的错误率,以50%的污染和原稿信号到ASR系统。本文显示的结果集中于尝试分析嘈杂环境中的ASR性能,并证明如果我们控制噪声水平,如果我们知道要工作的应用程序,那么我们可以获得更好的回应ASR任务。与这些结果有很有意思,因为我们可以在真实实验中找到的语音信号(从环境工作中提取,即,即,即),可以用这些技术对待,我们可以降低所获得的错误率。最后,当使用之前提到的语料库和LMS时,我们报告了从15,20,25,30,35和40个噪声水平获得的99%,97.5%96%,90.5%,81%和78.5%的识别率。使用算法。 Haar小波级别1将最重要的结果作为LMS算法的替代方案达到,但只有当噪声水平为40 dB并使用原始语料库时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号