首页> 外文会议>International Conference on Sciences of Electronics, Technologies of Information and Telecommunications >Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron
【24h】

Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron

机译:使用仿生小波变换和MFCC的多层感知器识别阿拉伯语语音

获取原文

摘要

In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.
机译:在本文中,我们提出了一种新的阿拉伯语语音识别(ASR)技术,它具有单定位符和减少的词汇量。这项技术的第一步是使用我们的适当语音数据库,其中包含阿拉伯语语音单词,这些单词由一个单一的讲者记录。第二步包括从那些记录的单词中提取特征。第三步是对那些提取的特征进行分类。通过在第一步中计算每个记录的单词的梅尔频率倒谱系数(MFCC),然后将仿生小波变换(BWT)应用于从计算的MFCC的级联中获得的矢量,来执行此提取。然后将获得的仿生子波系数连接起来,以构建用于特征分类的多层感知器(MLP)的一个输入。在MLP学习和测试阶段,我们使用了11个阿拉伯语单词,并且同一位讲师重复了25个单词。仿真程序被执行以测试所提出的技术的性能,并且显示出分类率等于99.39%。我们还引入了去噪模块作为预处理的一个阶段。在该降噪模块中,我们处理了白噪声的情况,并使用了维纳滤波。在SNR = 5dB的情况下,获得的识别率等于78.7%;在SNR = 10dB的情况下,获得的识别率等于93.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号