Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model

机译：使用广义单词模型和语义类语言模型检测OOV单词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an approach to detect out-of-vocabulary words in spontaneous speech using a language model built on semantic categories and a new type of generalized word models consisting of a mixture of specific and general acoustic units. We demonstrate the construction of the generalized word models as replacements for surnames in a German spontaneous travel planning task GSST. We show that the use of our generalized word models improves recognition accuracy in cases where out-of-vocabulary words appear and does not lead to a degradation of the overall recognition accuracy. In our experiments we measured recall and precision rates of OOV-detection which are close to their theoretic optimum. Furthermore, we compared the effect of using cross-word-triphones vs. using context-independent cross-word models. We show that when using generalized word models with cross-word-triphones, the expected number of consequential errors following an OOV word can be reduced significantly by 37%.

机译：本文介绍了一种使用基于语义类别的语言模型和由特定和一般声学单元的混合物组成的语言模型来检测自发语言中的语言模型的方法。我们展示了德国自发旅行计划任务GSST中姓氏的替代品的替代品。我们表明，在词汇外单词出现的情况下，使用我们的广义文字模型的使用提高了识别准确性，并且不会导致整体识别准确性的降低。在我们的实验中，我们测量了OOV检测的召回和精确率，其接近其理论最佳。此外，我们比较了使用跨字三倍频与使用上下文相关的跨词模型的效果。我们表明，使用具有跨字三相色调的通用单词模型时，OOV字后面的后续误差数可以明显减少37％。

著录项

来源
《European conference on speech communication and technology》|2001年||共4页
会议地点
作者
Thomas Schaaf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Handling OOV Words in Mandarin Spoken Term Detection with an Hierarchical n-Gram Language Model [J] . WANG Xuyang1, ZHANG Pengyuan1, NA Xingyu1, 电子学报：英文版 . 2017,第006期

机译：用分层N-GRAM语言模型处理普通话语言术语检测的OOV字
2. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [J] . Imran Sheikh, Dominique Fohr, Irina Illina, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第3期

机译：大词汇量连续语音识别中OOV词的语义上下文建模
3. Word Embedding Models for Finding Semantic Relationship between Words in Tamil Language [J] . S. G. Ajay, M. Srikanth, M. Anand Kumar, Indian Journal of Science and Technology . 2016,第45期

机译：查找泰米尔语单词之间语义关系的单词嵌入模型
4. Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model [C] . Thomas Schaaf European conference on speech communication and technology . 2001

机译：使用广义单词模型和语义类语言模型检测OOV单词
5. Connecting Documents, Words, and Languages Using Topic Models [D] . Yang, Weiwei. 2019

机译：使用主题模型连接文档，单词和语言
6. A Study of Reverse-Worded Matched Item Pairs Using the Generalized Partial Credit and Nominal Response Models [O] . Ki Lynn Matlock, Ronna C. Turner, W. Dent Gitchel 2018

机译：基于广义部分信用和名义响应模型的逆词匹配项对研究
7. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [O] . Imran Sheikh, Dominique Fohr, Irina Illina, 2017

机译：大型词汇连续语音识别中OOV单词的语义背景建模
8. From Word-Spotting to OOV Modeling [R] . Fitzpatrick, P. 2001

机译：从Word-spotting到OOV modeling

Detection of OOV Words Using Generalized Word Models and a Semantic Class Language Model

摘要

著录项

相似文献

相关主题

期刊订阅