首页> 外文会议>International conference on computational linguistics >Code-switch Language Model with Inversion Constraints for Mixed Language Speech Recognition
【24h】

Code-switch Language Model with Inversion Constraints for Mixed Language Speech Recognition

机译:具有反转约束的混合语言语音识别码转换语言模型

获取原文

摘要

We propose a first ever code-switch language model for mixed language speech recognition that incorporates syntactic constraints by a code-switch boundary prediction model, a code-switch translation model, and a reconstruction model. A WFST-based decoder then recognizes speech by combining an acoustic model, a pronunciation model and the code-switch language model in an integrated approach. Our proposed approach avoids making early decisions on code-switch boundaries and is therefore more robust than previous approaches. Our proposed system using the code-switch language model outperforms a baseline of interpolated language models by a statistically significant 0.91% on a mixed language lecture speech corpus, and 1.25% on a mixed language lunch conversation corpus. Our method also outperforms a language model that permits code-switch at all word boundaries by a statistically significant 1.35% on the lecture speech corpus and 1.69% on the lunch conversation corpus.
机译:我们提出了第一个用于混合语言语音识别的代码转换语言模型,该模型通过代码转换边界预测模型,代码转换翻译模型和重构模型结合了句法约束。然后,基于WFST的解码器通过以集成方法组合声学模型,发音模型和代码转换语言模型来识别语音。我们提出的方法避免了在代码切换边界上做出早期决策,因此比以前的方法更可靠。我们提出的使用代码切换语言模型的系统在混合语言讲演语料库上比插值语言模型的基准要好,具有统计学上的显着性0.91%,在混合语言午餐会话语料库上有1.25%的统计显着性。我们的方法还优于语言模型,后者允许在所有单词边界上进行代码转换,其演讲语音语料库的统计显着性为1.35%,午餐会话语料库的统计显着性为1.69%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号