An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition

机译：嘈杂语音识别广义语音课中深度神经网络的分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The introduction of Deep Neural Network (DNN) based acoustic models has produced dramatic improvements in performance. In particular, we have recently found that Deep Maxout Networks, a modification of DNNs' feed-forward architecture that uses a max-out activation function, provides enhanced robustness to environmental noise. In this paper we further investigate how these improvements are translated into the different broad phonetic classes and how does it compare to classical Hidden Markov Models (HMM) based back-ends. Our experiments demonstrate that performance is still tightly related to the particular phonetic class being stops and affricates the least resilient but also that relative improvements of both DNN variants are distributed unevenly across those classes having the type of noise a significant influence on the distribution. A combination of the different systems DNN and classical HMM is also proposed to validate our hypothesis that the traditional GMM/HMM systems have a different type of error than the Deep Neural Networks hybrid models.

机译：基于深度神经网络（DNN）的声学模型的引入产生了戏剧性的性能。特别是，我们最近发现Depe Maxout网络，DNNS的前馈架构的修改，用于使用最大化激活功能，提供增强的环境噪声的鲁棒性。在本文中，我们进一步调查了这些改进将这些改进转化为不同的广泛语音类以及它如何与基于古典隐马尔可夫模型（HMM）的后端进行比较。我们的实验表明，性能仍然与停止和递力最小弹性的特定语音类别紧密相关，但也是DNN变体的相对改进在具有对分布的显着影响的那些具有显着影响的这些类中不均匀地分布。还提出了不同系统DNN和经典HMM的组合来验证我们的假设，即传统的GMM / HMM系统具有与深神经网络混合模型不同类型的误差。

著录项

来源
《International Conference on Advances in Speech and Language Technologies for Iberian Languages》|2016年|288p|共10页
会议地点
作者
F. de-la-Calle-Silos; A. Gallardo-Antolin; C. Pelaez-Moreno;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Noise robustness; Deep Neural Networks; Dropout; Deep Maxout Networks; Speech recognition; Deep learning;

机译：噪声鲁棒性;深度神经网络;辍学;深度颤音网络;语音识别;深入学习;

相似文献

外文文献
中文文献
专利

1. Noisy training for deep neural networks in speech recognition [J] . Shi Yin, Chao Liu, Zhiyong Zhang, EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：用于语音识别的深度神经网络的噪声训练
2. Hierarchical Singleton-Type Recurrent Neural Fuzzy Networks for Noisy Speech Recognition [J] . Juang C.-F., Chiou C.-T., Lai C.-L. IEEE Transactions on Neural Networks . 2007,第3期

机译：分层单例类型递归神经模糊网络用于嘈杂语音识别
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition [C] . F. de-la-Calle-Silos, A. Gallardo-Antolin, C. Pelaez-Moreno International conference on advances in speech and language technologies for Iberian languages . 2016

机译：用于语音识别的广泛语音分类中的深度神经网络分析
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Noisy training for deep neural networks in speech recognition [O] . 2015

机译：语音识别中深度神经网络的噪声训练

An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅