Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages

机译：Hausa和Wolof语言中ASR的速度扰动和元音持续时间建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic Speech Recognition (ASR) for (under-resourced) Sub-Saharan African languages faces several challenges: small amount of transcribed speech, written language normalization issues, few text resources available for language modeling, as well as specific features (tones, morphology, etc.) that need to be taken into account seriously to optimize ASR performance. This paper tries to address some of the above challenges through the development of ASR systems for two Sub-Saharan African languages: Hausa and Wolof. First, we investigate data augmentation technique (through speed perturbation) to overcome the lack of resources. Secondly, the main contribution is our attempt to model vowel length contrast existing in both languages. For reproducible experiments, the ASR systems developed for Hausa and Wolof are made available to the research community on github. To our knowledge, the Wolof ASR system presented in this paper is the first large vocabulary continuous speech recognition system ever developed for this language.

机译：自动语音识别（ASR）（资源欠资源）撒哈拉以南非洲语言面临几种挑战：少量转录的语言，书面语言正常化问题，很少可用于语言建模的文本资源，以及特定的功能（音调，形态，等等）严重需要考虑到优化ASR性能。本文试图通过为两个撒哈拉以南非洲语言的ASR系统的开发来解决一些上述挑战：Hausa和Wolof。首先，我们调查数据增强技术（通过速度扰动）来克服缺乏资源。其次，主要贡献是我们试图模拟两种语言存在的元音长度对比度。对于可重复的实验，为GitHub上的研究界提供了为Hausa和Wolof开发的ASR系统。据我们所知，本文介绍的Wolof ASR系统是第一个为此语言开发的大型词汇连续语音识别系统。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|p3106-3887|共5页
会议地点
作者
Elodie Gauthier; Laurent Besacier; Sylvie Voisin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词

相似文献

外文文献
专利

1. Learning English vowels with different first-language vowel systems: Perception of formant targets, formant movement, and duration [J] . Iverson P, Evans BG The Journal of the Acoustical Society of America . 2007,第5期

机译：用不同的第一语言元音系统学习英语元音：共振峰目标的感知，共振峰的运动和持续时间
2. An Exploration of the Canon of Hausa Prose Fiction in Hausa Language and Translation: The Literary Contest of 1933 as a Historical Reference [J] . Chaibou Elhadji Oumarou Advances in Literary Study . 2017,第1期

机译：豪萨语和翻译中豪萨散文小说佳能的探索：以1933年文学大赛为历史参照
3. y Listen-and-repeat training improves perception of second language vowel duration: Evidence from mismatch negativity (MMN) and N1 responses and behavioral discrimination [J] . International journal of psychophysiology: official journal of the International Organization of Psychophysiology . 2020,第期

机译：Y倾听和重复培训改善了对第二语言元音持续时间的看法：来自不匹配的消极性（MMN）和N1响应和行为歧视的证据
4. Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages [C] . Elodie Gauthier, Laurent Besacier, Sylvie Voisin Annual Conference of the International Speech Communication Association . 2016

机译：Hausa和Wolof语言中ASR的速度扰动和元音持续时间建模
5. Task dependent modulation of voice fundamental frequency responses elicited by perturbations in pitch of auditory feedback during English speech and sustained vowels. [D] . Bauer, Jay Joseph. 2004

机译：在英语语音和持续元音期间，由听觉反馈音调的扰动引起的基于任务的语音基频响应调制。
6. Control of Movement: Vowel generalization and its relation to adaptation during perturbations of auditory feedback [O] . Kevin J. Reilly, Chelsea Pettibone -1

机译：运动控制：元音泛化及其与听觉反馈扰动期间的适应关系
7. Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages [O] . Gauthier, Elodie, Besacier, Laurent, Voisin, Sylvie 2016

机译：Hausa和Wolof语言中ASR的速度扰动和元音持续时间建模

Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages

摘要

著录项

相似文献

相关主题

期刊订阅