首页> 外文OA文献 >Modeling within-word and cross-word pronunciation variation to improve the performance of a Dutch CSR.
【2h】

Modeling within-word and cross-word pronunciation variation to improve the performance of a Dutch CSR.

机译:为单词内和单词间的发音变化建模,以改善荷兰CSR的性能。

摘要

This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling within-word and cross-word pronunciation variation. Within-word variants were automatically generated by applying five phonological rules to the words in the lexicon. For the within-word method, a significant improvement is found compared to the baseline. Cross-word pronunciation variation was modeled using two different methods: 1) adding cross-word variants directly to the lexicon, 2) only adding multi-words and their variants to the lexicon. Overall, cross-word method 2 leads to better results than cross-word method 1. The best results were obtained when cross-word method 2 was combined with the within-word method: a relative improvement of 8.8% WER was found compared to the baseline.
机译:本文介绍了如何通过对词内和跨词发音变化进行建模来改善荷兰语连续语音识别器的性能。通过将五个音标规则应用于词典中的单词,可以自动生成单词内变体。对于字内方法,发现与基线相比有显着改进。跨字发音变体的建模使用两种不同的方法:1)将跨字变体直接添加到词典中; 2)仅将多字及其变体添加到词典中。总体而言,填字游戏方法2的结果比填字游戏方法1更好。当将填字游戏方法2与内置字方法结合使用时,可获得最佳结果:与填字游戏方法相比WER相对提高了8.8%基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号