首页> 外文会议>European Conference on Speech Communication and Technology >Estimating Speech Recognition Error Rate without Acoustic Test Data
【24h】

Estimating Speech Recognition Error Rate without Acoustic Test Data

机译:估计没有声学测试数据的语音识别错误率

获取原文

摘要

We address the problem of estimating the word error rate (WER) of an automatic speech recognition (ASR) system without using acoustic test data. This is an important problem which is faced by the designers of new applications which use ASR. Quick estimate of WER early in the design cycle can be used to guide the decisions involving dialog strategy and grammar design. Our approach involves estimating the probability distribution of the word hypotheses produced by the underlying ASR system given the text test corpus. A critical component of this system is a phonemic confusion model which seeks to capture the errors made by ASR on the acoustic data at a phonemic level. We use a confusion model composed of probabilistic phoneme sequence conversion rules which are learned from phonemic transcription pairs obtained by leave-one-out decoding of the training set. We show reasonably close estimation of WER when applying the system to test sets from different domains.
机译:我们解决了在不使用声学测试数据的情况下估算自动语音识别(ASR)系统的错误率(WER)的问题。这是使用ASR的新应用程序设计人员面临的重要问题。在设计周期早期对WER的快速估计可用于指导涉及对话策略和语法设计的决策。我们的方法涉及估计由底层ASR系统产生的单词假设的概率分布给出了文本测试语料库。该系统的关键组成部分是一个音素混淆模型,它寻求在音素级别捕获ASR上的ASR上的错误。我们使用由概率的音素序列转换规则组成的混淆模型,这些模型由通过训练集的休留一次解码而获得的音素转录对。在将系统应用于从不同域测试的集合时,我们显示WER的合理估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号