首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Acoustic modeling with a shared phoneme set for multilingual speech recognition without code-switching
【24h】

Acoustic modeling with a shared phoneme set for multilingual speech recognition without code-switching

机译:带有共享音素集的声学建模,无需进行代码切换即可进行多语言语音识别

获取原文

摘要

This paper proposes a new acoustic modeling method for the automatic speech recognition (ASR) of data, in which multilingual utterances are mixed, without using any language identification technologies. To perform ASR of unknown-language utterance, first, language identification is performed to determine the language. Then, a language-specific ASR system is used to recognize the utterance. Our proposed method does not train language-specific acoustic models but trains an acoustic model that can speech-recognize utterances spoken by some sort of language. To realize multilingual acoustic modeling, we create a new phoneme set by sharing a part of language-specific phonemes with other languages. The shared phoneme set enables the amount of training data to increase on appearance. Therefore, the acoustic model with the shared phoneme set can perform ASR for a minor language (low-resource language) utterance. The experimental result showed that the acoustic model with the shared phoneme set improved ASR performance for a few languages in comparison with the language-specific ASR system in which language identification was perfectly performed.
机译:本文提出了一种新的用于数据自动语音识别(ASR)的声学建模方法,该方法无需使用任何语言识别技术即可将多种语言的语音混合在一起。为了执行未知语言话语的ASR,首先,执行语言识别以确定语言。然后,使用特定于语言的ASR系统来识别话语。我们提出的方法不是训练特定于语言的声学模型,而是训练可以语音识别某种语言说出的话语的声学模型。为了实现多语言声学建模,我们通过与其他语言共享特定语言音素的一部分来创建新的音素集。共享音素集使训练数据的数量在外观上得以增加。因此,具有共享音素集的声学模型可以针对次要语言(低资源语言)说话执行ASR。实验结果表明,与特定语言的ASR系统相比,共享音素的声学模型对几种语言的ASR性能有所提高,在特定语言中,ASR系统可以完美地进行语言识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号