首页> 外文期刊>International journal of speech technology >Within-word pronunciation variation modeling for Arabic ASRs:a direct data-driven approach
【24h】

Within-word pronunciation variation modeling for Arabic ASRs:a direct data-driven approach

机译:阿拉伯语ASR的词内发音变化建模:直接数据驱动方法

获取原文
获取原文并翻译 | 示例
           

摘要

Pronunciation variation is a major obstacle in improving the performance of Arabic automatic continuous speech recognition systems. This phenomenon alters the pronunciation spelling of words beyond their listed forms in the pronunciation dictionary, leading to a number of out of vocabulary word forms. This paper presents a direct data-driven approach to model within-word pronunciation variations, in which the pronunciation variants are distilled from the training speech corpus. The proposed method consists of performing phoneme recognition, followed by a sequence alignment between the observation phonemes generated by the phoneme recognizer and the reference phonemes obtained from the pronunciation dictionary. The unique collected variants are then added to dictionary as well as to the language model. We started with a Baseline Arabic speech recognition system based on Sphinx3 engine. The Baseline system is based on a 5.4 hours speech corpus of modern standard Arabic broadcast news, with a pronunciation dictionary of 14,234 canonical pronunciations. The Baseline system achieves a word error rate of 13.39%. Our results show that while the expanded dictionary alone did not add appreciable improvements, the word error rate is significantly reduced by 2.22% when the variants are represented within the language model.
机译:语音变化是提高阿拉伯自动连续语音识别系统性能的主要障碍。这种现象使单词的发音拼写超出了其在发音词典中列出的形式,从而导致了许多单词单词形式的出现。本文提出了一种直接的数据驱动方法来建模词内发音变化,其中从训练语音语料库中提取出发音变化。所提出的方法包括执行音素识别,然后是音素识别器生成的观察音素与从发音词典中获得的参考音素之间的序列比对。然后将收集到的独特变体添加到字典以及语言模型中。我们从基于Sphinx3引擎的基线阿拉伯语语音识别系统开始。 Baseline系统基于现代标准阿拉伯广播新闻的5.4小时语音语料库,带有14,234种规范发音的发音词典。基准系统实现了13.39%的字错误率。我们的结果表明,尽管仅扩展字典并没有带来明显的改进,但是当在语言模型中表示变体时,单词错误率显着降低了2.22%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号