首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Phonetically-oriented word error alignment for speech recognition error analysis in speech translation
【24h】

Phonetically-oriented word error alignment for speech recognition error analysis in speech translation

机译:用于语音翻译的语音识别错误分析的面向语音的单词错误对齐

获取原文

摘要

We propose a variation to the commonly used Word Error Rate (WER) metric for speech recognition evaluation which incorporates the alignment of phonemes, in the absence of time boundary information. After computing the Levenshtein alignment on words in the reference and hypothesis transcripts, spans of adjacent errors are converted into phonemes with word and syllable boundaries and a phonetic Levenshtein alignment is performed. The phoneme alignment information is used to correct the word alignment labels in each error region. We demonstrate that our Phonetically-Oriented Word Error Rate (POWER) yields similar scores to WER with the added advantages of better word alignments and the ability to capture one-to-many alignments corresponding to homophonic errors in speech recognition hypotheses. These improved alignments allow us to better trace the impact of Levenshtein error types in speech recognition on downstream tasks such as speech translation.
机译:我们提出了一种用于语音识别评估的常用词错误率(WER)度量标准的变体,该度量标准在没有时间边界信息的情况下结合了音素的对齐方式。在参考和假设记录中对单词进行Levenshtein对齐计算后,将相邻错误的跨度转换为具有单词和音节边界的音素,并执行语音Levenshtein对齐。音素对齐信息用于更正每个错误区域中的单词对齐标签。我们证明,面向语音的单词错误率(POWER)产生的分数与WER相似,并且具有更好的单词对齐方式以及捕获与语音识别假设中同音错误相对应的一对多对齐方式的附加优势。这些改进的对齐方式使我们能够更好地跟踪Levenshtein错误类型在语音识别中对下游任务(如语音翻译)的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号