首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Joint acoustic factor learning for robust deep neural network based automatic speech recognition
【24h】

Joint acoustic factor learning for robust deep neural network based automatic speech recognition

机译:基于鲁棒深度神经网络的自动语音识别联合声学因子学习

获取原文

摘要

Deep neural networks (DNNs) for acoustic modeling have been shown to provide impressive results on many state-of-the-art automatic speech recognition (ASR) applications. However, DNN performance degrades due to mismatches in training and testing conditions and thus adaptation is necessary. In this paper, we explore the use of discriminative auxiliary input features obtained using joint acoustic factor learning for DNN adaptation. These features are derived from a bottleneck (BN) layer of a DNN and are referred to as BN vectors. To derive these BN vectors, we explore the use of two types of joint acoustic factor learning which capture speaker and auxiliary information such as noise, phone and articulatory information of speech. In this paper, we show that these BN vectors can be used for adaptation and thereby improve the performance of an ASR system. We also show that the performance can be further improved on augmenting these BN vectors to conventional i-vectors. In this paper, experiments are performed on Aurora-4, REVERB challenge and AMI databases.
机译:已经证明,用于声学建模的深度神经网络(DNN)可在许多最新的自动语音识别(ASR)应用中提供令人印象深刻的结果。但是,由于训练和测试条件的不匹配,DNN性能会下降,因此需要进行调整。在本文中,我们探索使用通过联合声学因子学习获得的可判别辅助输入特征进行DNN自适应。这些特征源自DNN的瓶颈(BN)层,被称为BN向量。为了导出这些BN向量,我们探索了两种类型的联合声学因子学习的使用,它们可以捕获说话者和辅助信息,例如噪声,电话和语音的发音信息。在本文中,我们表明这些BN向量可用于自适应,从而提高ASR系统的性能。我们还表明,在将这些BN向量扩展为常规i向量时,可以进一步提高性能。在本文中,对Aurora-4,REVERB挑战和AMI数据库进行了实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号