首页> 外文会议>Annual conference of the International Speech Communication Association >Automatic Vocabulary Adaptation Based on Semantic Similarity and Speech Recognition Confidence Measure
【24h】

Automatic Vocabulary Adaptation Based on Semantic Similarity and Speech Recognition Confidence Measure

机译:基于语义相似度和语音识别置信度的词汇自动适应

获取原文

摘要

Out-Of-Vocabulary (OOV) word utterances are unavoidable in speech recognition since the vocabulary size of a recognition dictionary is limited. And therefore, automatic vocabulary adaptation, which selects unregistered (i.e. OOV) words from relevant documents and registers them to a dictionary with their proper probability values, is an important technique. To improve recognition accuracy, a vocabulary adaptation method is required to register only relevant words that will actually be spoken in target utterances and not to register words that will not be spoken (i.e. redundant word entries). In this paper, we propose a novel automatic vocabulary adaptation method that satisfies these requirements based on semantic and acoustic similarities. Acoustic similarity is represented in speech recognition confidence measure. Experiments show that, with our method, the word selection accuracy is improved twice and the recognition accuracy focused on newly registered words is improved 15.1% in F-measure, compared with conventional methods.
机译:由于识别词典的词汇量有限,因此语音识别中不可避免地出现了词汇外(OOV)话语。因此,自动词汇自适应是一项重要技术,该技术可以从相关文档中选择未注册(即OOV)单词并将其注册到词典中。为了提高识别的准确性,需要一种词汇适应方法来仅注册将以目标话语实际说出的相关单词,而不是注册将不会说出的单词(即,多余的单词条目)。在本文中,我们提出了一种新颖的自动词汇自适应方法,该方法可以基于语义和声学相似性来满足这些要求。语音相似度在语音识别置信度中表示。实验表明,与传统方法相比,我们的方法在F-measure中将单词选择准确率提高了两倍,对新注册单词的识别准确率提高了15.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号