首页> 外文会议>International Conference on speech and computer >Adaptation of DNN Acoustic Models Using KL-divergence Regularization and Multi-task Training
【24h】

Adaptation of DNN Acoustic Models Using KL-divergence Regularization and Multi-task Training

机译:使用KL-散度正则化和多任务训练适应DNN声学模型

获取原文

摘要

The adaptation of context-dependent deep neural network acoustic models is particularly challenging, because most of the context-dependent targets will have no occurrences in a small adaptation data set. Recently, a multi-task training technique has been proposed that trains the network with context-dependent and context-independent targets in parallel. This network structure offers a straightforward way for network adaptation by training only the context-independent part during the adaptation process. Here, we combine this simple adaptation technique with the KL-divergence regularization method also proposed recently. Employing multi-task training we attain a relative word error rate reduction of about 3 % on a broadcast news recognition task. Then, by using the combined adaptation technique we report a further error rate reduction of 2 % to 5 %, depending on the duration of the adaptation data, which ranged from 20 to 100 s.
机译:依赖于上下文的深度神经网络声学模型的适应性尤其具有挑战性,因为大多数依赖于上下文的目标在较小的适应性数据集中都不会出现。最近,已经提出了一种多任务训练技术,该技术并行训练具有上下文相关和上下文独立目标的网络。通过在适应过程中仅训练与上下文无关的部分,此网络结构为网络适应提供了一种直接的方法。在这里,我们将这种简单的自适应技术与最近也提出的KL-散度正则化方法相结合。通过多任务培训,我们在广播新闻识别任务上的相对单词错误率降低了约3%。然后,通过使用组合的自适应技术,我们报告了进一步的错误率降低了2%到5%,具体取决于适应数据的持续时间,范围从20到100 s。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号