首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system
【24h】

Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

机译:语言翻译系统中言语阿拉伯语文本与语音合成的自动发音预测

获取原文

摘要

Text-to-speech synthesis (TTS) is the final stage in the speech-tospeech (S2S) translation pipeline, producing an audible rendition of translated text in the target language. TTS systems typically rely on a lexicon to look up pronunciations for each word in the input text. This is problematic when the target language is dialectal Arabic, because the statistical machine translation (SMT) system usually produces undiacritized text output. Many words in the latter possess multiple pronunciations; the correct choice must be inferred from context. In this paper, we present a weakly supervised pronunciation prediction approach for undiacritized dialectal Arabic in S2S systems that leverages automatic speech recognition (ASR) to obtain parallel training data for pronunciation prediction. Additionally, we show that incorporating source language features derived from SMT-generated automatic word alignment further improves automatic pronunciation prediction accuracy.
机译:文本到语音合成(TTS)是语音 - tospeech(S2S)翻译流水线中的最终阶段,在目标语言中产生可听文本的可听迭代。 TTS系统通常依赖于Lexicon以查找输入文本中的每个单词的发音。当目标语言是辩证阿拉伯语时,这是有问题的,因为统计机器翻译(SMT)系统通常会产生未经编译的文本输出。后者的许多单词都有多个发音;必须从上下文中推断出正确的选择。在本文中,我们在S2S系统中为未经译码的语言阿拉伯语发出了弱监督的发音预测方法,其利用自动语音识别(ASR)来获得语音预测的并行训练数据。此外,我们表明,源语言源语言功能源自SMT生成的自动字对齐,进一步提高了自动发音预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号