首页> 外文期刊>Research Journal of Applied Sciences: RJAS >Genetic Supervised Classification of Standard Arabic Fricative Consonants for the Automatic Speech Recognition
【24h】

Genetic Supervised Classification of Standard Arabic Fricative Consonants for the Automatic Speech Recognition

机译:自动语音识别的标准阿拉伯摩擦音的遗传监督分类

获取原文
           

摘要

The purpose of this study is the application of the Genetic Algorithms (GAs) to the supervised classification level, in order to recognize Standard Arabic (SA) fricative consonants of continuous, naturally spoken, speech. We have used GAs because of their advantages in resolving complicated optimization problems, where analytic methods fail. For that, we have analyzed a corpus that contains several sentences composed of the thirteen types of fricative consonants in the initial, medium and final positions, recorded by several male Jordanian speakers. Nearly all the world’s languages contain at least one fricative sound. The SA language occupies a rather exceptional position in that nearly half of it’s consonants is fricatives and nearly half of fricative inventory is situated far back in the uvular, pharyngeal and glottal areas. We have used Mel Frequency Cepstrum Coefficients (MFCCs) method to extract vocal tract coefficients from the speech signal. To represents temporal variations in the speech signal, the first and second derivatives of both MFCCs and energy are added to the set of static parameters. The acoustic segments classification and the GAs have been explored. Among a set of classifiers like Bayesian, likelihood and distance classifier, we have used the distance one. It is based on the classification measure criterion. So, we formulate the supervised classification as a function optimization problem and we have used the decision rule Mahalanobis distance as the fitness function for the GA evaluation. We report promising results with a classification recognition accuracy of 82%.
机译:这项研究的目的是将遗传算法(GA)应用于有监督的分类级别,以便识别连续的自然口语语音的标准阿拉伯语(SA)摩擦辅音。我们之所以使用GA,是因为它们在解决分析方法失败的复杂优化问题方面具有优势。为此,我们分析了一个语料库,该语料库由几位约旦男性演讲者记录,包含在初始,中间和最后位置由十三种摩擦辅音组成的几句话。世界上几乎所有语言都包含至少一种摩擦音。 SA语言占有相当特殊的位置,因为它的辅音中有近一半是摩擦词,而近一半的摩擦词则位于小舌,咽部和声门区域。我们已使用梅尔频率倒谱系数(MFCCs)方法从语音信号中提取声道系数。为了表示语音信号中的时间变化,将MFCC和能量的一阶和二阶导数添加到静态参数集。声学段分类和遗传算法已被探索。在贝叶斯,似然和距离分类器等一组分类器中,我们使用了距离分类器。它基于分类度量标准。因此,我们将监督分类表述为一个函数优化问题,并将决策规则马哈拉诺比斯距离作为适应度函数进行遗传算法评估。我们报告了有希望的结果,分类识别准确率达82%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号