首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system
【24h】

Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

机译:在语音到语音翻译系统中用于方言阿拉伯语文本到语音合成的自动语音预测

获取原文

摘要

Text-to-speech synthesis (TTS) is the final stage in the speech-tospeech (S2S) translation pipeline, producing an audible rendition of translated text in the target language. TTS systems typically rely on a lexicon to look up pronunciations for each word in the input text. This is problematic when the target language is dialectal Arabic, because the statistical machine translation (SMT) system usually produces undiacritized text output. Many words in the latter possess multiple pronunciations; the correct choice must be inferred from context. In this paper, we present a weakly supervised pronunciation prediction approach for undiacritized dialectal Arabic in S2S systems that leverages automatic speech recognition (ASR) to obtain parallel training data for pronunciation prediction. Additionally, we show that incorporating source language features derived from SMT-generated automatic word alignment further improves automatic pronunciation prediction accuracy.
机译:语音合成(TTS)是语音(S2S)翻译流程中的最后阶段,可产生目标语言翻译后的声音。 TTS系统通常依赖于词典来查找输入文本中每个单词的发音。当目标语言是方言阿拉伯语时,这是有问题的,因为统计机器翻译(SMT)系统通常会产生不带文字的文本输出。后者中的许多单词具有多种发音;必须根据上下文推断出正确的选择。在本文中,我们提出了一种在S2S系统中对不绝经的方言阿拉伯语进行弱监督的语音预测方法,该方法利用自动语音识别(ASR)获得用于语音预测的并行训练数据。此外,我们表明,结合从SMT生成的自动单词对齐中获得的源语言功能,可以进一步提高自动发音预测的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号