【24h】

Estimating Speech Recognition Error Rate without Acoustic Test Data

机译:在没有声学测试数据的情况下估计语音识别错误率

获取原文
获取原文并翻译 | 示例

摘要

We address the problem of estimating the word error rate (WER) of an automatic speech recognition (ASR) system without using acoustic test data. This is an important problem which is faced by the designers of new applications which use ASR. Quick estimate of WER early in the design cycle can be used to guide the decisions involving dialog strategy and grammar design. Our approach involves estimating the probability distribution of the word hypotheses produced by the underlying ASR system given the text test corpus. A critical component of this system is a phonemic confusion model which seeks to capture the errors made by ASR on the acoustic data at a phonemic level. We use a confusion model composed of probabilistic phoneme sequence conversion rules which are learned from phonemic transcription pairs obtained by leave-one-out decoding of the training set. We show reasonably close estimation of WER when applying the system to test sets from different domains.
机译:我们解决了在不使用声学测试数据的情况下估算自动语音识别(ASR)系统的单词错误率(WER)的问题。这是使用ASR的新应用程序设计人员面临的一个重要问题。在设计周期的早期对WER进行快速估算可用于指导涉及对话策略和语法设计的决策。我们的方法涉及在给定文本测试语料库的情况下,估计由潜在ASR系统产生的单词假设的概率分布。该系统的关键组成部分是音素混淆模型,该模型试图以音素水平捕获ASR对声学数据造成的错误。我们使用由概率音素序列转换规则组成的混淆模型,这些规则是从通过训练集的留一法解码获得的音素转录对中学习的。当将系统应用于来自不同领域的测试集时,我们显示了合理的WER估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号