首页> 外文会议>European Conference on Speech Communication and Technology v.4; 20010903-20010907; Aalborg; DK >Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model
【24h】

Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model

机译:使用广义词模型和语义类语言模型的OOV词检测

获取原文
获取原文并翻译 | 示例

摘要

This paper describes an approach to detect out-of-vocabulary words in spontaneous speech using a language model built on semantic categories and a new type of generalized word models consisting of a mixture of specific and general acoustic units. We demonstrate the construction of the generalized word models as replacements for surnames in a German spontaneous travel planning task GSST. We show that the use of our generalized word models improves recognition accuracy in cases where out-of-vocabulary words appear and does not lead to a degradation of the overall recognition accuracy. In our experiments we measured recall and precision rates of OOV-detection which are close to their theoretic optimum. Furthermore, we compared the effect of using cross-word-triphones vs. using context-independent cross-word models. We show that when using generalized word models with cross-word-triphones, the expected number of consequential errors following an OOV word can be reduced significantly by 37%.
机译:本文介绍了一种方法,该方法使用基于语义类别的语言模型和一种由特定和通用声学单元组成的新型广义词模型来检测自发语音中的词汇外词。我们在德国自发旅行计划任务GSST中演示了通用词模型的构造,以代替姓氏。我们表明,在出现词汇外单词的情况下,使用广义单词模型可以提高识别准确性,并且不会导致整体识别准确性的下降。在我们的实验中,我们测量了OOV检测的查全率和准确率,它们接近理论上的最佳值。此外,我们比较了使用交叉词三音组与使用上下文无关的交叉词模型的效果。我们表明,当使用带有跨字三音词的广义字模型时,遵循OOV字的预期结果错误数可以显着减少37%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号