Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

机译：在语音到语音翻译系统中用于方言阿拉伯语文本到语音合成的自动语音预测

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Text-to-speech synthesis (TTS) is the final stage in the speech-tospeech (S2S) translation pipeline, producing an audible rendition of translated text in the target language. TTS systems typically rely on a lexicon to look up pronunciations for each word in the input text. This is problematic when the target language is dialectal Arabic, because the statistical machine translation (SMT) system usually produces undiacritized text output. Many words in the latter possess multiple pronunciations; the correct choice must be inferred from context. In this paper, we present a weakly supervised pronunciation prediction approach for undiacritized dialectal Arabic in S2S systems that leverages automatic speech recognition (ASR) to obtain parallel training data for pronunciation prediction. Additionally, we show that incorporating source language features derived from SMT-generated automatic word alignment further improves automatic pronunciation prediction accuracy.

机译：语音合成（TTS）是语音（S2S）翻译流程中的最后阶段，可产生目标语言翻译后的声音。 TTS系统通常依赖于词典来查找输入文本中每个单词的发音。当目标语言是方言阿拉伯语时，这是有问题的，因为统计机器翻译（SMT）系统通常会产生不带文字的文本输出。后者中的许多单词具有多种发音；必须根据上下文推断出正确的选择。在本文中，我们提出了一种在S2S系统中对不绝经的方言阿拉伯语进行弱监督的语音预测方法，该方法利用自动语音识别（ASR）获得用于语音预测的并行训练数据。此外，我们表明，结合从SMT生成的自动单词对齐中获得的源语言功能，可以进一步提高自动发音预测的准确性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4957- 4960|共4页
会议地点 Kyoto(JP)
作者
Ananthakrishnan, Sankaranarayanan;
展开▼
作者单位

Speech Language and Multimedia Unit Raytheon BBN Technologies Cambridge MA 02138 U.S.A.;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Development of an automatic phonetization system for Arabic text-to-speech synthesis [J] . Faycal Imedjdouben, Amrane Houacine International journal of speech technology . 2014,第4期

机译：开发用于阿拉伯文本语音合成的自动语音系统
2. Cross-Dialect Adaptation Framework for Constructing Prosodic Models for Chinese Dialect Text-to-Speech Systems [J] . Chen-Yu Chiang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第1期

机译：跨方言适应框架构建汉语方言文本语音系统的韵律模型
3. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect [J] . Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Procedia Computer Science . 2017,第1期

机译：阿拉伯语Loria自动语音识别系统（ALASR）的开发及其对阿尔及利亚方言的评估
4. Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system [C] . Ananthakrishnan S., Tsakalidis S., Prasad R., IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：语言翻译系统中言语阿拉伯语文本与语音合成的自动发音预测
5. An Investigation into Approaches to Text-to-Speech Synthesis for Modern Standard Arabic [D] . ?Alabbad, Dena A. 2019

机译：现代标准阿拉伯文文本综合综合方法的调查
6. A Neural Machine Translation Model for Arabic Dialects That Utilises Multitask Learning (MTL) [O] . Laith H. Baniata, Seyoung Park, Seong-Bae Park 2018

机译：利用多任务学习（MTL）的阿拉伯语神经机器翻译模型
7. An Analysis of Machine Translation and Speech Synthesis in Speech-To-Speech Translation System [O] . Hashimoto, Kei, Yamagishi, Junichi, Byrne, William, 2011

机译：语音转换系统中的机器翻译和语音合成分析

Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅