首页> 外文会议>European conference on speech communication and technology >Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model
【24h】

Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model

机译:使用广义单词模型和语义类语言模型检测OOV单词

获取原文

摘要

This paper describes an approach to detect out-of-vocabulary words in spontaneous speech using a language model built on semantic categories and a new type of generalized word models consisting of a mixture of specific and general acoustic units. We demonstrate the construction of the generalized word models as replacements for surnames in a German spontaneous travel planning task GSST. We show that the use of our generalized word models improves recognition accuracy in cases where out-of-vocabulary words appear and does not lead to a degradation of the overall recognition accuracy. In our experiments we measured recall and precision rates of OOV-detection which are close to their theoretic optimum. Furthermore, we compared the effect of using cross-word-triphones vs. using context-independent cross-word models. We show that when using generalized word models with cross-word-triphones, the expected number of consequential errors following an OOV word can be reduced significantly by 37%.
机译:本文介绍了一种使用基于语义类别的语言模型和由特定和一般声学单元的混合物组成的语言模型来检测自发语言中的语言模型的方法。我们展示了德国自发旅行计划任务GSST中姓氏的替代品的替代品。我们表明,在词汇外单词出现的情况下,使用我们的广义文字模型的使用提高了识别准确性,并且不会导致整体识别准确性的降低。在我们的实验中,我们测量了OOV检测的召回和精确率,其接近其理论最佳。此外,我们比较了使用跨字三倍频与使用上下文相关的跨词模型的效果。我们表明,使用具有跨字三相色调的通用单词模型时,OOV字后面的后续误差数可以明显减少37%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号