首页> 外文期刊>International journal of speech technology >Advanced classification approach for neuronal phoneme recognition system based on efficient constructive training algorithm
【24h】

Advanced classification approach for neuronal phoneme recognition system based on efficient constructive training algorithm

机译:基于有效构造训练算法的神经元音素识别系统高级分类方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper introduces a neural network optimization procedure allowing the generation of multilayer per-ceptron (MLP) network topologies with few connections, low complexity and high classification performance for phoneme's recognition. An efficient constructive algorithm with incremental training using a new proposed Frame by Frame Neural Networks (FFNN) classification approach for automatic phoneme recognition is thus proposed. It is based on a novel recruiting hidden neuron's procedure for a single hidden-layer. After an initializing phase started with initial small number of hidden neurons, this algorithm allows the Neural Networks (NNs) to adjust automatically its parameters during the training phase. The modular FFNN classification method is then constructed and tested to recognize 5 broad phonetic classes extracted from the TIMIT database. In order to take into account the speech variability related to the coarticulation effect, a Context Window of Three Successive Frame's (CWTSF) analysis is applied. Although, an important reduction of the computational training time is observed, this technique penalized the overall Phone Recognition Rate (PRR) and increased the complexity of the recognition system. To alleviate these limitations, two feature dimensionality reduction techniques respectively based on Principal Component Analysis (PCA) and Self Organizing Maps (SOM) are investigated. It is observed an important improvement in the performance of the recognition system when the PCA technique is applied. Optimal neuronal phone recognition architecture is finally derived according to the following criteria: best PRR, minimum computational training time and complexity of the BPNN architecture.
机译:本文介绍了一种神经网络优化程序,该程序允许生成连接数少,复杂度低和分类性能高的多层每个感知器(MLP)网络拓扑,以用于音素识别。因此,提出了一种有效的增量训练有效构造算法,该算法采用新提出的逐帧神经网络(FFNN)分类方法进行自动音素识别。它基于一种新颖的针对单个隐藏层的隐藏神经元募集程序。在初始阶段以最初的少量隐藏神经元开始之后,该算法允许神经网络(NN)在训练阶段自动调整其参数。然后构建模块化FFNN分类方法并进行测试,以识别从TIMIT数据库提取的5种广泛的语音分类。为了考虑与协同发音效果相关的语音可变性,应用了三个连续帧的上下文窗口(CWTSF)分析。尽管可以观察到计算训练时间的显着减少,但该技术不利于整体电话识别率(PRR),并增加了识别系统的复杂性。为了减轻这些限制,研究了分别基于主成分分析(PCA)和自组织图(SOM)的两种特征降维技术。当应用PCA技术时,可以观察到识别系统性能的重要提高。最终根据以下标准得出最佳的神经元电话识别架构:最佳PRR,最小的计算训练时间和BPNN架构的复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号