首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >LEARNING EFFECTIVE FACTORIZED HIDDEN LAYER BASES USING STUDENT-TEACHER TRAINING FOR LSTM ACOUSTIC MODEL ADAPTATION
【24h】

LEARNING EFFECTIVE FACTORIZED HIDDEN LAYER BASES USING STUDENT-TEACHER TRAINING FOR LSTM ACOUSTIC MODEL ADAPTATION

机译:使用学生 - 教师培训学习有效分解隐藏层基础,用于LSTM声学模型适应

获取原文

摘要

Factorized Hidden Layer (FHL) has been proposed for the adaptation of deep neural network (DNN) and Long Short-Term Memory (LSTM) based acoustic models (AMs). In FHL, a speaker-dependent (SD) transformation matrix and an SD bias are included in addition to the standard affine transformation. The SD transformation is a linear combination of rank-1 matrices whereas the SD bias is a linear combination of vectors. However, the adaptation of LSTMs is challenging and often reports modest gains. In this paper, we propose to use student-teacher training to estimate more efficient FHL bases for LSTM AMs using an FHL adapted DNN as the teacher model. For both AMI IHM and AMI SDM tasks, FHL achieves 3.2% absolute improvement over the frame-level cross entropy trained LSTM baselines. Moreover, FHL results 3.0% and 3.8% absolute improvements over sequentially trained LSTM baselines for the AMI IHM and AMI SDM tasks respectively.
机译:已经提出了分解隐藏层(FHL),用于适应深度神经网络(DNN)和基于长短期存储器(LSTM)的声学模型(AMS)。在FHL中,除了标准仿射变换之外,还包括扬声器依赖性(SD)变换矩阵和SD偏压。 SD变换是秩-1矩阵的线性组合,而SD偏置是载体的线性组合。但是,LSTMS的适应性挑战,通常报告适度的收益。在本文中,我们建议使用学生教师培训来估计LSTM AMS的更高效的FHL基础,使用FHL适应了DNN作为教师模型。对于AMI IHM和AMI SDM任务,FHL在帧级交叉熵培训的LSTM基线上实现了3.2%的绝对改进。此外,FHL结果分别为AMI IHM和AMI SDM任务的顺序训练的LSTM基线的绝对改进3.0%和3.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号