首页> 外文会议> >Error corrective mechanisms for speech recognition
【24h】

Error corrective mechanisms for speech recognition

机译:语音识别的纠错机制

获取原文

摘要

In the standard MAP approach to speech recognition, the goal is to find the word sequence with the highest posterior probability given the acoustic observation. A number of alternate approaches have been proposed for directly optimizing the word error rate, the most commonly used evaluation criterion. One of them, the consensus decoding approach, converts a word lattice into a confusion network which specifies the word-level confusions at different time intervals, and outputs the word with the highest posterior probability from each word confusion set. The paper presents a method for discriminating between the correct and alternate hypotheses in a confusion set using additional knowledge sources extracted from the confusion networks. We use transformation-based learning for inducing a set of rules to guide a better decision between the top two candidates with the highest posterior probabilities in each confusion set. The choice of this learning method is motivated by the perspicuous representation of the rules induced, which can provide insight into the cause of the errors of a speech recognizer. In experiments on the Switchboard corpus, we show significant improvements over the consensus decoding approach.
机译:在用于语音识别的标准MAP方法中,目标是在进行声学观察的情况下找到具有最高后验概率的单词序列。已经提出了许多替代方法来直接最优化最常用的评估标准字错误率。其中之一是共识解码方法,将词格转换为混淆网络,该网络指定了不同时间间隔的词级混淆,并从每个词混淆集中输出具有最高后验概率的词。本文提出了一种方法,该方法使用从混乱网络中提取的其他知识源来区分混乱集中的正确假设和替代假设。我们使用基于变换的学习来引入一组规则,以指导在每个混淆集中具有最高后验概率的前两个候选者之间做出更好的决策。这种学习方法的选择是由规则的清晰表示所激发的,这些规则可以提供对语音识别器错误原因的洞察力。在Switchboard语料库上的实验中,我们显示出对共识解码方法的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号