首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Context adaptive deep neural networks for fast acoustic model adaptation
【24h】

Context adaptive deep neural networks for fast acoustic model adaptation

机译:用于快速声学模型自适应的上下文自适应深度神经网络

获取原文

摘要

Deep neural networks (DNNs) are widely used for acoustic modeling in automatic speech recognition (ASR), since they greatly outperform legacy Gaussian mixture model-based systems. However, the levels of performance achieved by current DNN-based systems remain far too low in many tasks, e.g. when the training and testing acoustic contexts differ due to ambient noise, reverberation or speaker variability. Consequently, research on DNN adaptation has recently attracted much interest. In this paper, we present a novel approach for the fast adaptation of a DNN-based acoustic model to the acoustic context. We introduce a context adaptive DNN with one or several layers depending on external factors that represent the acoustic conditions. This is realized by introducing a factorized layer that uses a different set of parameters to process each class of factors. The output of the factorized layer is then obtained by weighted averaging over the contribution of the different factor classes, given posteriors over the factor classes. This paper introduces the concept of context adaptive DNN and describes preliminary experiments with the TIMIT phoneme recognition task showing consistent improvement with the proposed approach.
机译:深度神经网络(DNN)大大优于传统的基于高斯混合模型的系统,因此广泛用于自动语音识别(ASR)中的声学建模。但是,在许多任务中,例如,基于DNN的系统,当前基于DNN的系统所达到的性能水平仍然太低。当训练和测试声学环境由于环境噪声,混响或说话者变化而有所不同时。因此,有关DNN适应性的研究近来引起了人们的极大兴趣。在本文中,我们提出了一种新颖的方法,用于将基于DNN的声学模型快速适应声学环境。我们介绍了一种上下文自适应DNN,它取决于表示声学条件的外部因素,具有一层或多层。这是通过引入分解层来实现的,该层使用不同的参数集来处理每一类因子。然后,给定因子类的后代,可以通过对不同因子类的贡献进行加权平均来获得因子分解层的输出。本文介绍了上下文自适应DNN的概念,并描述了TIMIT音素识别任务的初步实验,显示了与所提出方法的一致改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号