Context adaptive deep neural networks for fast acoustic model adaptation

机译：用于快速声学模型自适应的上下文自适应深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks (DNNs) are widely used for acoustic modeling in automatic speech recognition (ASR), since they greatly outperform legacy Gaussian mixture model-based systems. However, the levels of performance achieved by current DNN-based systems remain far too low in many tasks, e.g. when the training and testing acoustic contexts differ due to ambient noise, reverberation or speaker variability. Consequently, research on DNN adaptation has recently attracted much interest. In this paper, we present a novel approach for the fast adaptation of a DNN-based acoustic model to the acoustic context. We introduce a context adaptive DNN with one or several layers depending on external factors that represent the acoustic conditions. This is realized by introducing a factorized layer that uses a different set of parameters to process each class of factors. The output of the factorized layer is then obtained by weighted averaging over the contribution of the different factor classes, given posteriors over the factor classes. This paper introduces the concept of context adaptive DNN and describes preliminary experiments with the TIMIT phoneme recognition task showing consistent improvement with the proposed approach.

机译：深度神经网络（DNN）大大优于传统的基于高斯混合模型的系统，因此广泛用于自动语音识别（ASR）中的声学建模。但是，在许多任务中，例如，基于DNN的系统，当前基于DNN的系统所达到的性能水平仍然太低。当训练和测试声学环境由于环境噪声，混响或说话者变化而有所不同时。因此，有关DNN适应性的研究近来引起了人们的极大兴趣。在本文中，我们提出了一种新颖的方法，用于将基于DNN的声学模型快速适应声学环境。我们介绍了一种上下文自适应DNN，它取决于表示声学条件的外部因素，具有一层或多层。这是通过引入分解层来实现的，该层使用不同的参数集来处理每一类因子。然后，给定因子类的后代，可以通过对不同因子类的贡献进行加权平均来获得因子分解层的输出。本文介绍了上下文自适应DNN的概念，并描述了TIMIT音素识别任务的初步实验，显示了与所提出方法的一致改进。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|4535-4539|共5页
会议地点
作者
Delcroix Marc; Kinoshita Keisuke; Hori Takaaki; Nakatani Tomohiro;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Acoustic model adaptation; Automatic speech recognition; Context adaptive DNN; Deep neural networks; Factorized DNN;

机译：声学模型自适应;语音自动识别;上下文自适应DNN;深度神经网络;特征化DNN;

相似文献

外文文献
中文文献
专利

1. Acoustic model training based on node-wise weight boundary model for fast and small-footprint deep neural networks [J] . Takeda Ryu, Nakadai Kazuhiro, Komatani Kazunori Computer speech and language . 2017,第nova期

机译：基于节点权重边界模型的快速和小足迹深度神经网络的声学模型训练
2. Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling [J] . Lahiru Samarakoon, Khe Chai Sim Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：基于深度神经网络的声学建模的分解隐藏层自适应
3. Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models [J] . Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第2期

机译：深度神经网络声学模型中上下文相关目标的多任务学习
4. Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions [C] . Marc Delcroix, Keisuke Kinoshita, Chengzhu Yu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：上下文自适应深度神经网络可在嘈杂条件下快速实现声学模型自适应
5. Deep Neural Network acoustic models for ASR. [D] . Mohamed, Abdel-rahman. 2014

机译：适用于ASR的深度神经网络声学模型。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Multi-task deep neural network acoustic models with model adaptation using discriminative speaker identity for whisper recognition [O] . Li Jingjie, McLoughlin Ian Vince, Liu Cong, 2016

机译：具有判别性说话人身份的模型自适应的多任务深度神经网络声学模型用于耳语识别

Context adaptive deep neural networks for fast acoustic model adaptation

摘要

著录项

相似文献

相关主题

期刊订阅