首页> 外文期刊>Procedia Computer Science >Automatic Speech Recognition of English-isiZulu Code-switched Speech from South African Soap Operas
【24h】

Automatic Speech Recognition of English-isiZulu Code-switched Speech from South African Soap Operas

机译:南非肥皂剧中英语-西祖鲁语代码转换语音的自动语音识别

获取原文
       

摘要

We introduce a new English-isiZulu code-switched speech corpus compiled from South African soap opera broadcasts. isiZulu itself is currently under-resourced, and automatic speech recognition is made even more challenging by the high prevalence of code-switching in spontaneous speech. Analysis of the corpus reflects effects common in conversational isiZulu, such as vowel deletion and cross-language prefixes and suffixes. Baseline monolingual and code-switched automatic speech recognition systems are developed, including a new language model configuration that explicitly includes switching transitions. For code-switched speech, a system with language-dependent acoustic models and language-dependent language models linked by switching transitions leads to best performance, although word error rates overall remain very high.
机译:我们介绍了一种由南非肥皂剧播出的新的英语-isiZulu代码转换语音语料库。 isiZulu本身目前资源不足,并且自发语音中代码转换的普遍性使自动语音识别更具挑战性。语料库的分析反映了会话岛上常见的影响,例如元音删除和跨语言前缀和后缀。开发了基准单语言和代码转换的自动语音识别系统,包括新的语言模型配置,该配置明确包含转换转换。对于代码转换语音,尽管总体上词错误率仍然很高,但具有通过转换转换链接的与语言相关的声学模型和与语言相关的语言模型的系统可实现最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号