Evaluation of Modified Deep Neural Network Architecture Performance for Speech Recognition

机译：语音识别的改进型深度神经网络架构性能评估

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, Deep Neural Networks (DNN) has been widely used for pattern recognition and classification applications because of its high accuracy. Here in this paper, we propose four different Deep Neural Network (DNN) architectures and comparison is made between these four proposed DNN architectures in terms of accuracy and training time. The proposed DNN models are evaluated for speech recognition application using TIDIGITS corpus. Mel-Frequency Cepstral Coefficients (MFCC) technique is used to extract feature vectors of speech data. It is observed that modified triangular architecture gave the highest accuracy of 99.31 % as compared to other architectures while the triangular architecture gave the least training time of 49.72 sec. Furthermore, results of proposed DNN architecture is compared with the existing Hidden Markov Model based speech recognition and the proposed DNN provide an increased accuracy of 2.33%.

机译：近年来，由于深度神经网络（DNN）的高准确性，已被广泛用于模式识别和分类应用。在本文中，我们提出了四种不同的深度神经网络（DNN）架构，并在准确性和训练时间方面对这四种提出的DNN架构进行了比较。使用TIDIGITS语料对所提出的DNN模型进行语音识别应用评估。梅尔频率倒谱系数（MFCC）技术用于提取语音数据的特征向量。可以看出，与其他架构相比，修改后的三角形架构提供了最高的99.31％的精度，而三角形架构则提供了49.72 sec的最少训练时间。此外，将提出的DNN体系结构的结果与现有的基于隐马尔可夫模型的语音识别进行了比较，提出的DNN提供了2.33％的提高的准确性。

著录项

来源
《International Conference on Intelligent and Advanced Systems》|2018年|1-5|共5页
会议地点
作者
Md Amaan Haque; John Sahaya Rani Alex; Nithya Venkatesan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neurons; Computer architecture; Speech recognition; Hidden Markov models; Biological neural networks; Training;

机译：神经元;计算机体系结构;语音识别;隐马尔可夫模型;生物神经网络;训练;

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Deep Neural Networks Applied to Speech Recognition: RNN, LSTM and GRU [J] . Apeksha Shewalkar, Deepika Nyavanandi, Simone A. Ludwig Journal of Artificial Intelligence and Soft Computing Research . 2019,第4期

机译：深度神经网络在语音识别中的性能评估：RNN，LSTM和GRU
2. Power, Performance, and Area Benefit of Monolithic 3D ICs for On-Chip Deep Neural Networks Targeting Speech Recognition [J] . Chang Kyungwook, Kadetotad Deepak, Cao Yu, ACM Journal on Emerging Technologies in Computing Systems . 2018,第4期

机译：用于芯片识别的片上深度神经网络的单片3D IC的电源，性能和面积利益
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Evaluation of Modified Deep Neural Network Architecture Performance for Speech Recognition [C] . Md Amaan Haque, John Sahaya Rani Alex, Nithya Venkatesan International Conference on Intelligent and Advanced Systems . 2018

机译：语音识别修改深神经网络架构性能的评价
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Performance Evaluation of Deep Neural Networks Applied to Speech Recognition: RNN, LSTM and GRU [O] . Apeksha Shewalkar, Deepika Nyavanandi, Simone A. Ludwig 2019

机译：深度神经网络应用于语音识别的性能评估：RNN，LSTM和GRU

Evaluation of Modified Deep Neural Network Architecture Performance for Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅