首页> 外文会议>Annual Conference of the International Speech Communication Association >Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages
【24h】

Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages

机译:Hausa和Wolof语言中ASR的速度扰动和元音持续时间建模

获取原文

摘要

Automatic Speech Recognition (ASR) for (under-resourced) Sub-Saharan African languages faces several challenges: small amount of transcribed speech, written language normalization issues, few text resources available for language modeling, as well as specific features (tones, morphology, etc.) that need to be taken into account seriously to optimize ASR performance. This paper tries to address some of the above challenges through the development of ASR systems for two Sub-Saharan African languages: Hausa and Wolof. First, we investigate data augmentation technique (through speed perturbation) to overcome the lack of resources. Secondly, the main contribution is our attempt to model vowel length contrast existing in both languages. For reproducible experiments, the ASR systems developed for Hausa and Wolof are made available to the research community on github. To our knowledge, the Wolof ASR system presented in this paper is the first large vocabulary continuous speech recognition system ever developed for this language.
机译:自动语音识别(ASR)(资源欠资源)撒哈拉以南非洲语言面临几种挑战:少量转录的语言,书面语言正常化问题,很少可用于语言建模的文本资源,以及特定的功能(音调,形态,等等)严重需要考虑到优化ASR性能。本文试图通过为两个撒哈拉以南非洲语言的ASR系统的开发来解决一些上述挑战:Hausa和Wolof。首先,我们调查数据增强技术(通过速度扰动)来克服缺乏资源。其次,主要贡献是我们试图模拟两种语言存在的元音长度对比度。对于可重复的实验,为GitHub上的研究界提供了为Hausa和Wolof开发的ASR系统。据我们所知,本文介绍的Wolof ASR系统是第一个为此语言开发的大型词汇连续语音识别系统。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号