【24h】

Speech Recognition Using context Conditional word Posterior Probabilities

机译:使用上下文条件词后验概率的语音识别

获取原文

摘要

In this paper two new scoring schemes for large vocabulary continuous speech recognition are compared. Instead of using the joint probability of a word sequence and a sequence of acoustic observations, we determien the best path through a word graph using posterior word probabilities with or without word context. The exact calulation of the posterior probability for a word sequence implies a sum over all possible word boundaries, which is approximated by a maximum operation in the standard scoring approach. The new scoring scheme using word posterior probabilities could be expected to lead to improved recognition performance, because it involves partial summation over word boundaries. We present experimetnal results on five differnet corpora, the Dutch Arise corpus, the German Verbmobil '98 corpus, the English North American Business '94 20k and 64k development corpora, and the English Broadcast News '96 corpus. It is shown that the Viterbi approxiamtion within words has no efect on standard and word posterior based recognition. Using word posteriro probabilities with an without word context, the relative reduction in word error rate is comparable and ranges between 1.5
机译:本文比较了两种针对大词汇量连续语音识别的新评分方案。代替使用单词序列和听觉观测序列的联合概率,我们使用带有或不带有单词上下文的后单词概率来确定通过单词图的最佳路径。单词序列的后验概率的精确计算意味着所有可能的单词边界上的总和,这可以通过标准评分方法中的最大运算来近似。使用单词后验概率的新评分方案有望提高识别性能,因为它涉及单词边界的部分求和。我们介绍了五个不同的网络语料库,荷兰Arise语料库,德国Verbmobil '98语料库,英语North America Business '94 20k和64k开发语料库以及英语广播新闻'96语料库的实验结果。结果表明,单词内的维特比近似对基于标准和单词后验的识别没有影响。使用没有单词上下文的单词后验概率,单词错误率的相对降低是可比较的,范围在1.5

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号