首页> 外文会议>International conference on Asian language processing >Research on Transfer Learning for Khalkha Mongolian Speech Recognition Based on TDNN
【24h】

Research on Transfer Learning for Khalkha Mongolian Speech Recognition Based on TDNN

机译:基于TDNN的喀尔喀语蒙古语语音识别迁移学习研究

获取原文

摘要

Automated speech recognition(ASR)incorporating Neural Networks with Hidden Markov Models (NNs/HMMs)have achieved the state-of-the-art in various benchmarks. Most of them use a large amount of training data. However, ASR research is still quite difficult in languages with limited resources, such as Khalkha Mongolian. Transfer learning methods have been shown to be effective utilizing out-of-domain data to improve ASR performance in similar data-scarce. In this paper, we investigate two different weight transfer approaches to improve the performance of Khalkha Mongolian ASR based on Lattice-free Maximum Mutual Information(LF-MMI). Moreover, the i-vector feature is used to combine with the MFCCs feature as the input to validate the effectiveness of Khalkha Mongolian ASR transfer models. Experimental results show that the weight transfer methods with out-of-domain Chahar speech can achieve great improvements over baseline model on Khalkha speech. And transferring parts of the model performs better than transferring the whole model. Furthermore, the i-vector spliced together with MFCCs as input features can further enhance the performance of the acoustic model. The WER of optimal model is relatively reduced by 10.96% compared with the in-of-domain Khalkha speech baseline model.
机译:结合了神经网络和隐马尔可夫模型(NNs / HMM)的自动语音识别(ASR)在各种基准测试中均达到了最新水平。他们大多数使用大量的训练数据。但是,对于资源有限的语言(例如Khalkha Mongolian),ASR研究仍然非常困难。传输学习方法已被证明可以有效地利用域外数据来改善类似数据稀缺情况下的ASR性能。在本文中,我们研究了两种基于无格最大互信息(LF-MMI)的不同权重传递方法,以提高Khalkha蒙古ASR的性能。此外,i矢量功能用于与MFCC功能组合作为输入,以验证Khalkha Mongolian ASR传递模型的有效性。实验结果表明,采用域外Chahar语音的权重传递方法可以比Khalkha语音的基线模型有很大的改进。转移模型的部分要比转移整个模型的性能更好。此外,与MFCC拼接在一起的i-vector作为输入特征可以进一步增强声学模型的性能。与域内Khalkha语音基准模型相比,最优模型的WER相对减少了10.96%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号