首页> 外文会议>Association for Computational Linguistics Annual Meeting >Speech Recognition of Czech - Inclusion of Rare Words Helps
【24h】

Speech Recognition of Czech - Inclusion of Rare Words Helps

机译:捷克语的语音识别 - 包含稀有语言有所帮助

获取原文

摘要

Large vocabulary continuous speech recognition of inflective languages, such as Czech, Russian or Serbo-Croatian, is heavily deteriorated by excessive out of vocabulary rate. In this paper, we tackle the problem of vocabulary selection, language modeling and pruning for inflective languages. We show that by explicit reduction of out of vocabulary rate we can achieve significant improvements in recognition accuracy while almost preserving the model size. Reported results are on Czech speech corpora.
机译:大量词汇连续语音识别捷克语,俄语或塞尔博罗地亚语,如捷克语,俄罗斯或塞尔克罗地亚语,因词汇量过多而严重恶化。在本文中,我们解决了转向语言的词汇选择,语言建模和修剪问题。我们表明,通过显着减少词汇率,我们可以实现识别准确性的显着改进,同时几乎保持模型大小。据报道的结果是在捷克语演讲基础上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号