首页> 外文期刊>IEEE transactions on audio, speech and language processing >Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations
【24h】

Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations

机译:在DARPA EARS评估中分析2004 BBN语音识别系统产生的错误

获取原文
获取原文并翻译 | 示例

摘要

This paper aims to quantify the main error types the 2004 BBN speech recognition system made in the broadcast news (BN) and conversational telephone speech (CTS) DARPA EARS evaluations. We show that many of the remaining errors occur in clusters rather than isolated, have specific causes, and differ to some extent between the BN and CTS domains. The correctly recognized words are also clustered and are highly correlated with regions where the system produces a single hypothesized choice per word. A statistical analysis of some well-known error causes (out-of-vocabulary words, word fragments, hesitations, and unlikely language constructs) was performed in order to assess their contribution to the overall word error rate (WER). We conclude with a discussion of the lower bound on the WER introduced by the human annotator disagreement.
机译:本文旨在量化2004 BBN语音识别系统在广播新闻(BN)和会话电话语音(CTS)DARPA EARS评估中所做的主要错误类型。我们表明,许多剩余的错误发生在群集中而不是孤立的群集中,具有特定的原因,并且在BN和CTS域之间存在一定程度的差异。正确识别的单词也会聚类,并且与系统对每个单词产生单个假设选择的区域高度相关。对一些众所周知的错误原因(词汇外单词,单词片段,犹豫和不太可能的语言构造)进行了统计分析,以评估它们对总体单词错误率(WER)的贡献。最后,我们讨论了人类注释者意见分歧引发的WER的下限。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号