首页> 外文会议>Annual conference of the International Speech Communication Association >Cross-Lingual and Ensemble MLPs Strategies for Low-Resource Speech Recognition
【24h】

Cross-Lingual and Ensemble MLPs Strategies for Low-Resource Speech Recognition

机译:低资源语音识别的跨语言和整体MLP策略

获取原文

摘要

Recently there has been some interest in the question of how to build LVCSR systems for the low-resource languages. The scenario we focus on here is having only one hour of acoustic training data in the "target" language, but more plentiful data in other languages. This paper presents approaches using MLP based features: we construct a low-resource system with additional sources of information from the non-target languages to train the cross-lingual MLPs. A hierarchical architecture and multi-stream strategy are applied on the cross-lingual phone level, to improve the neural network more discriminatively. Additionally, an elaborate ensemble system with various acoustic feature streams and context expansion lengths is proposed. After system combination with these two strategies we get significant improvements of more than 8% absolute versus a conventional baseline in this low-resource scenario with only one hour of target training data.
机译:最近,对于如何为低资源语言构建LVCSR系统的问题引起了一些兴趣。我们在这里关注的场景是只有一个小时的“目标”语言的声学训练数据,而其他语言的数据则更多。本文介绍了使用基于MLP的功能的方法:我们构建了一个资源少的系统,其中包含来自非目标语言的其他信息源,以训练跨语言的MLP。在跨语言的电话级别上应用了分层体系结构和多流策略,以更具区分性地改进神经网络。此外,提出了一种具有各种声学特征流和上下文扩展长度的复杂合奏系统。在将这两种策略与系统结合后,在这种资源匮乏的情况下,仅使用一小时的目标训练数据,相对于传统基准,我们的绝对值显着提高了8%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号