Phonetically-oriented word error alignment for speech recognition error analysis in speech translation

机译：用于语音翻译的语音识别错误分析的面向语音的单词错误对齐

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a variation to the commonly used Word Error Rate (WER) metric for speech recognition evaluation which incorporates the alignment of phonemes, in the absence of time boundary information. After computing the Levenshtein alignment on words in the reference and hypothesis transcripts, spans of adjacent errors are converted into phonemes with word and syllable boundaries and a phonetic Levenshtein alignment is performed. The phoneme alignment information is used to correct the word alignment labels in each error region. We demonstrate that our Phonetically-Oriented Word Error Rate (POWER) yields similar scores to WER with the added advantages of better word alignments and the ability to capture one-to-many alignments corresponding to homophonic errors in speech recognition hypotheses. These improved alignments allow us to better trace the impact of Levenshtein error types in speech recognition on downstream tasks such as speech translation.

机译：我们提出了一种用于语音识别评估的常用词错误率（WER）度量标准的变体，该度量标准在没有时间边界信息的情况下结合了音素的对齐方式。在参考和假设记录中对单词进行Levenshtein对齐计算后，将相邻错误的跨度转换为具有单词和音节边界的音素，并执行语音Levenshtein对齐。音素对齐信息用于更正每个错误区域中的单词对齐标签。我们证明，面向语音的单词错误率（POWER）产生的分数与WER相似，并且具有更好的单词对齐方式以及捕获与语音识别假设中同音错误相对应的一对多对齐方式的附加优势。这些改进的对齐方式使我们能够更好地跟踪Levenshtein错误类型在语音识别中对下游任务（如语音翻译）的影响。

著录项

来源
《IEEE Workshop on Automatic Speech Recognition and Understanding》|2015年|296-302|共7页
会议地点
作者
Nicholas Ruiz; Marcello Federico;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
automatic speech recognition; error analysis; mixed-effects models; speech translation;

机译：自动语音识别;错误分析;混合效果模型;语音翻译;

相似文献

外文文献
中文文献
专利

1. Spoken Word Recognition Errors in Speech Audiometry: A Measure of Hearing Performance? [J] . Martine Coene, Anneke van der Lee, Paul J. Govaerts BioMed research international . 2015,第44期

机译：语音听力学中的口语识别错误：听证性能的衡量标准吗？
2. Spoken Word Recognition Errors in Speech Audiometry: A Measure of Hearing Performance? [J] . MartineCoene, Annekevan der Lee, Paul J.Govaerts BioMed research international . 2015,第2期

机译：语音测听中的口语单词识别错误：听力表现的一种度量？
3. Integration of speech recognition and machine translation: Speech recognition word lattice translation [J] . Ruiqiang Zhang, Genichiro Kikui Speech Communication . 2006,第3a4期

机译：语音识别和机器翻译的集成：语音识别词格翻译
4. Phonetically-oriented word error alignment for speech recognition error analysis in speech translation [C] . Nicholas Ruiz, Marcello Federico IEEE Workshop on Automatic Speech Recognition and Understanding . 2015

机译：语音翻译中语音识别错误分析的语音误差对齐
5. Analysis of Mandarin tonal errors in connected speech by English -speaking American adult learners: A study at and above the word level. [D] . Chen, Qinghai. 2000

机译：美国英语为成人的成年学习者在关联语音中的普通话语调错误分析：一项在单词级别或更高级别的研究。
6. Spoken Word Recognition Errors in Speech Audiometry: A Measure of Hearing Performance? [O] . Martine Coene, Anneke van der Lee, Paul J. Govaerts -1

机译：语音测听中的口语单词识别错误：听力表现的一种度量？
7. Why word error rate is not a good metric for speech recognizer training for the speech translation task [O] . Xiaodong He, Li Deng, Alex Acero 2013

机译：为什么单词错误率不是语音翻译任务的语音识别器训练的好指标

Phonetically-oriented word error alignment for speech recognition error analysis in speech translation

摘要

著录项

相似文献

相关主题

期刊订阅