首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Prosodic Word-Based Error Correction in Speech Recognition Using Prosodic Word Expansion and Contextual Information
【24h】

Prosodic Word-Based Error Correction in Speech Recognition Using Prosodic Word Expansion and Contextual Information

机译:基于韵律词扩展和上下文信息的语音识别中的基于韵律词的纠错

获取原文

摘要

In this study, considering the effect of phrase grouping in spontaneous speech, prosodic words, instead of lexical words, are adopted as the units for error correction of speech recognition results. The prosodic words and the corresponding mis-recognized word fragments are obtained from a speech database to construct a mis-recognized word fragment table for the extracted prosodic words. For each word fragment in a recognized word sequence, the potential prosodic words which are likely to be misrecognized as input word fragments are retrieved from the table for prosodic word candidate expansion. The prosodic word-based contextual information, considering substitution and concatenation scores, is then employed into a probabilistic model to find the best word fragment sequence as the corrected output. Experimental results show that the proposed method achieved a 0.32 F1 score, with improvements of 0.18 and 0.10 compared to the SMT-based and lexical word-based approaches, respectively.
机译:在这项研究中,考虑到自发语音中短语分组的影响,采用韵律词而不是词汇词作为语音识别结果的纠错单位。从语音数据库获得韵律词和相应的误认词片段,以为提取的韵律词构造误认词片段表。对于识别的单词序列中的每个单词片段,从表中检索可能会被误识别为输入单词片段的潜在韵律词,以用于韵律词候选词扩展。然后将考虑替换和连接分数的基于韵律词的上下文信息用于概率模型中,以找到最佳词片段序列作为校正后的输出。实验结果表明,与基于SMT的方法和基于词的词法相比,该方法的F1得分为0.32,分别提高了0.18和0.10。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号