首页> 外文会议>International Conference on Intelligent and Advanced Systems >Evaluation of Modified Deep Neural Network Architecture Performance for Speech Recognition
【24h】

Evaluation of Modified Deep Neural Network Architecture Performance for Speech Recognition

机译:语音识别的改进型深度神经网络架构性能评估

获取原文
获取外文期刊封面目录资料

摘要

Recently, Deep Neural Networks (DNN) has been widely used for pattern recognition and classification applications because of its high accuracy. Here in this paper, we propose four different Deep Neural Network (DNN) architectures and comparison is made between these four proposed DNN architectures in terms of accuracy and training time. The proposed DNN models are evaluated for speech recognition application using TIDIGITS corpus. Mel-Frequency Cepstral Coefficients (MFCC) technique is used to extract feature vectors of speech data. It is observed that modified triangular architecture gave the highest accuracy of 99.31 % as compared to other architectures while the triangular architecture gave the least training time of 49.72 sec. Furthermore, results of proposed DNN architecture is compared with the existing Hidden Markov Model based speech recognition and the proposed DNN provide an increased accuracy of 2.33%.
机译:近年来,由于深度神经网络(DNN)的高准确性,已被广泛用于模式识别和分类应用。在本文中,我们提出了四种不同的深度神经网络(DNN)架构,并在准确性和训练时间方面对这四种提出的DNN架构进行了比较。使用TIDIGITS语料对所提出的DNN模型进行语音识别应用评估。梅尔频率倒谱系数(MFCC)技术用于提取语音数据的特征向量。可以看出,与其他架构相比,修改后的三角形架构提供了最高的99.31%的精度,而三角形架构则提供了49.72 sec的最少训练时间。此外,将提出的DNN体系结构的结果与现有的基于隐马尔可夫模型的语音识别进行了比较,提出的DNN提供了2.33%的提高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号