Estimating Speech Recognition Error Rate without Acoustic Test Data

机译：在没有声学测试数据的情况下估计语音识别错误率

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of estimating the word error rate (WER) of an automatic speech recognition (ASR) system without using acoustic test data. This is an important problem which is faced by the designers of new applications which use ASR. Quick estimate of WER early in the design cycle can be used to guide the decisions involving dialog strategy and grammar design. Our approach involves estimating the probability distribution of the word hypotheses produced by the underlying ASR system given the text test corpus. A critical component of this system is a phonemic confusion model which seeks to capture the errors made by ASR on the acoustic data at a phonemic level. We use a confusion model composed of probabilistic phoneme sequence conversion rules which are learned from phonemic transcription pairs obtained by leave-one-out decoding of the training set. We show reasonably close estimation of WER when applying the system to test sets from different domains.

机译：我们解决了在不使用声学测试数据的情况下估算自动语音识别（ASR）系统的单词错误率（WER）的问题。这是使用ASR的新应用程序设计人员面临的一个重要问题。在设计周期的早期对WER进行快速估算可用于指导涉及对话策略和语法设计的决策。我们的方法涉及在给定文本测试语料库的情况下，估计由潜在ASR系统产生的单词假设的概率分布。该系统的关键组成部分是音素混淆模型，该模型试图以音素水平捕获ASR对声学数据造成的错误。我们使用由概率音素序列转换规则组成的混淆模型，这些规则是从通过训练集的留一法解码获得的音素转录对中学习的。当将系统应用于来自不同领域的测试集时，我们显示了合理的WER估计。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.2; 20030901-04; Geneva(CH)》|2003年|P.929-932|共4页
会议地点 Geneva(CH)
作者
Yonggang Deng; Milind Mahajan; Alex Acero;
展开▼
作者单位

Center for Language and Speech Processing Johns Hopkins University, Baltimore, MD 21218, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Estimating an upper Bound for the Error Rate for Speech Recognition using Entropy [J] . Harald Hoge AEU: Archiv fur Elektronik und Ubertragungstechnik: Electronic and Communication . 1999,第4期

机译：使用熵估计语音识别的错误率上限
2. Testing Google Scholar bibliographic data: Estimating error rates for Google Scholar citation parsing [J] . David Zeitlyn, Megan Beardmore Herd First Monday . 2018,第11期

机译：测试Google Scholar书目数据：估算Google Scholar引用解析的错误率
3. Correction to "discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition" [J] . Wu Chou Proceedings of the IEEE . 2000,第11期

机译：对“基于判别函数的最小识别错误率模式识别方法进行语音识别的修正”
4. Estimating Speech Recognition Error Rate without Acoustic Test Data [C] . Yonggang Deng, Milind Mahajan, Alex Acero, European Conference on Speech Communication and Technology . 2003

机译：估计没有声学测试数据的语音识别错误率
5. Linear transforms in automatic speech recognition: Estimation procedures and integration of diverse acoustic data. [D] . Tsakalidis, Stavros. 2006

机译：自动语音识别中的线性变换：估计程序和各种声学数据的集成。
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Searching Acoustic Patterns in Speech Data without Recognition [O] . Skácel Miroslav 2012

机译：在语音数据中搜索声音模式而无需识别

Estimating Speech Recognition Error Rate without Acoustic Test Data

摘要

著录项

相似文献

相关主题

期刊订阅