LEARNING EFFECTIVE FACTORIZED HIDDEN LAYER BASES USING STUDENT-TEACHER TRAINING FOR LSTM ACOUSTIC MODEL ADAPTATION

机译：使用学生 - 教师培训学习有效分解隐藏层基础，用于LSTM声学模型适应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Factorized Hidden Layer (FHL) has been proposed for the adaptation of deep neural network (DNN) and Long Short-Term Memory (LSTM) based acoustic models (AMs). In FHL, a speaker-dependent (SD) transformation matrix and an SD bias are included in addition to the standard affine transformation. The SD transformation is a linear combination of rank-1 matrices whereas the SD bias is a linear combination of vectors. However, the adaptation of LSTMs is challenging and often reports modest gains. In this paper, we propose to use student-teacher training to estimate more efficient FHL bases for LSTM AMs using an FHL adapted DNN as the teacher model. For both AMI IHM and AMI SDM tasks, FHL achieves 3.2% absolute improvement over the frame-level cross entropy trained LSTM baselines. Moreover, FHL results 3.0% and 3.8% absolute improvements over sequentially trained LSTM baselines for the AMI IHM and AMI SDM tasks respectively.

机译：已经提出了分解隐藏层（FHL），用于适应深度神经网络（DNN）和基于长短期存储器（LSTM）的声学模型（AMS）。在FHL中，除了标准仿射变换之外，还包括扬声器依赖性（SD）变换矩阵和SD偏压。 SD变换是秩-1矩阵的线性组合，而SD偏置是载体的线性组合。但是，LSTMS的适应性挑战，通常报告适度的收益。在本文中，我们建议使用学生教师培训来估计LSTM AMS的更高效的FHL基础，使用FHL适应了DNN作为教师模型。对于AMI IHM和AMI SDM任务，FHL在帧级交叉熵培训的LSTM基线上实现了3.2％的绝对改进。此外，FHL结果分别为AMI IHM和AMI SDM任务的顺序训练的LSTM基线的绝对改进3.0％和3.8％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|5739-6377p|共5页
会议地点
作者
Lahiru Samarakoon; Brian Mak; Khe Chai Sim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Long Short-Term memory (LSTM); Recurrent Neural Networks (RNNs); Speaker Adaptation; Student-teacher training; Acoustic Modeling;

机译：长短期记忆（LSTM）;经常性神经网络（RNN）;扬声器适应;学生教师培训;声学建模;

相似文献

外文文献
中文文献
专利

1. Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling [J] . Lahiru Samarakoon, Khe Chai Sim Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第12期

机译：基于深度神经网络的声学建模的分解隐藏层自适应
2. Incremental Multiple Hidden Layers Regularized Extreme Learning Machine Based on Forced Positive-Definite Cholesky Factorization [J] . Jingyi Liu, Ba Tuan Le Mathematical Problems in Engineering: Theory, Methods and Applications . 2019,第1期

机译：基于强制正定的巧克力分解的增量多个隐藏图层正规化的极端学习机
3. Online Learning and Acoustic Feature Adaptation in Large Margin Hidden Markov Models [J] . Cheng C.-C., Sha F., Saul L. K. Selected Topics in Signal Processing, IEEE Journal of . 2010,第99期

机译：大余量隐马尔可夫模型中的在线学习和声学特征自适应
4. LEARNING EFFECTIVE FACTORIZED HIDDEN LAYER BASES USING STUDENT-TEACHER TRAINING FOR LSTM ACOUSTIC MODEL ADAPTATION [C] . Lahiru Samarakoon, Brian Mak, Khe Chai Sim IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：使用学生 - 教师培训学习有效分解隐藏层基础，用于LSTM声学模型适应
5. Hidden Markov Model based animal acoustic censusing: Learning from speech processing technology [D] . Adi, C. Kuntoro 2008

机译：基于隐马尔可夫模型的动物声学统计：从语音处理技术中学习
6. Sensor Drift Compensation Based on the Improved LSTM and SVM Multi-Class Ensemble Learning Models [O] . Xia Zhao, Pengfei Li, Kaitai Xiao, 2019

机译：基于改进的LSTM和SVM多类集成学习模型的传感器漂移补偿
7. Utterance-based Selective Training for Cost-Effective Task-Adaptation of Acoustic Models [O] . Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, 2006

机译：基于言语的选择性训练，用于声学模型的成本有效的任务自适应

LEARNING EFFECTIVE FACTORIZED HIDDEN LAYER BASES USING STUDENT-TEACHER TRAINING FOR LSTM ACOUSTIC MODEL ADAPTATION

摘要

著录项

相似文献

相关主题

期刊订阅