首页> 外文会议>International Conference on Advances in Speech and Language Technologies for Iberian Languages >An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition
【24h】

An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition

机译:嘈杂语音识别广义语音课中深度神经网络的分析

获取原文

摘要

The introduction of Deep Neural Network (DNN) based acoustic models has produced dramatic improvements in performance. In particular, we have recently found that Deep Maxout Networks, a modification of DNNs' feed-forward architecture that uses a max-out activation function, provides enhanced robustness to environmental noise. In this paper we further investigate how these improvements are translated into the different broad phonetic classes and how does it compare to classical Hidden Markov Models (HMM) based back-ends. Our experiments demonstrate that performance is still tightly related to the particular phonetic class being stops and affricates the least resilient but also that relative improvements of both DNN variants are distributed unevenly across those classes having the type of noise a significant influence on the distribution. A combination of the different systems DNN and classical HMM is also proposed to validate our hypothesis that the traditional GMM/HMM systems have a different type of error than the Deep Neural Networks hybrid models.
机译:基于深度神经网络(DNN)的声学模型的引入产生了戏剧性的性能。特别是,我们最近发现Depe Maxout网络,DNNS的前馈架构的修改,用于使用最大化激活功能,提供增强的环境噪声的鲁棒性。在本文中,我们进一步调查了这些改进将这些改进转化为不同的广泛语音类以及它如何与基于古典隐马尔可夫模型(HMM)的后端进行比较。我们的实验表明,性能仍然与停止和递力最小弹性的特定语音类别紧密相关,但也是DNN变体的相对改进在具有对分布的显着影响的那些具有显着影响的这些类中不均匀地分布。还提出了不同系统DNN和经典HMM的组合来验证我们的假设,即传统的GMM / HMM系统具有与深神经网络混合模型不同类型的误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号