首页> 外文期刊>International journal of speech technology >A comparative study for Arabic speech recognition system in noisy environments
【24h】

A comparative study for Arabic speech recognition system in noisy environments

机译:嘈杂环境中阿拉伯语语音识别系统的比较研究

获取原文
获取原文并翻译 | 示例
       

摘要

Speech recognition in noisy environments is one of the long-standing research themes but remains a very important challenge nowadays. Therefore, there is much research into all techniques and approaches to improve the performance of speech recognition systems, even in poor conditions. This paper presents a comparative study under various conditions based on two architectures (GMM-HMM and DNN-HMM), the Hybrid GMM-HMM models using the CMU Sphinx tools and the Hybrid DNN-HMM using the KALDI toolkit in noise environment. In this study, we compare the Hybrid GMM-HMM models and the Hybrid DNN-HMM models to evaluate the performance of the proposed system. The novelty of this paper is to test if the presented tools could be, with good accuracy, recognize the Arabic speech principally in noisy environment. In addition, we adopted the noisy training theory in this paper based on GMM-HMM and DNN-HMM model. We use the public Arabic Speech Corpus for Isolated Words (20 words), three noise levels, and three noise types. The implementation of our system consists of two phases: Features extraction using Mel-frequency Cepstral Coefficient (MFCC) and the classification phase will use separately the previous two models. In order to test the performance of these methods a simulation will presented for different SNR and for different district type of noise.
机译:嘈杂环境中的语音识别是一项长期的研究主题之一,但现在仍然是一个非常重要的挑战。因此,即使在差的条件下,所有技术和方法都有许多技术和方法来提高语音识别系统的性能。本文在基于两种架构(GMM-HMM和DNN-HMM)的各种条件下,使用CMU Sphinx工具和使用KALDI工具包中的混合GMM-HMM模型在噪声环境中使用CMU SPHINX工具和混合DNN-HMM的各种条件下进行了比较研究。在这项研究中,我们比较混合GMM-HMM模型和混合DNN-HMM模型来评估所提出的系统的性能。本文的新颖性是测试所提出的工具是否可能具有良好的准确性,主要在嘈杂的环境中识别阿拉伯语演讲。此外,我们在本文中采用了基于GMM-HMM和DNN-HMM模型的嘈杂培训理论。我们使用公共阿拉伯语语音语料库(20字),三个噪声水平和三种噪声类型。我们的系统的实施包括两个阶段:使用熔融频率谱系码(MFCC)提取特征,分类阶段将分别使用前两种型号。为了测试这些方法的性能,将为不同的SNR和不同的地区类型的噪声呈现模拟。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号