Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

机译：语言翻译系统中言语阿拉伯语文本与语音合成的自动发音预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text-to-speech synthesis (TTS) is the final stage in the speech-tospeech (S2S) translation pipeline, producing an audible rendition of translated text in the target language. TTS systems typically rely on a lexicon to look up pronunciations for each word in the input text. This is problematic when the target language is dialectal Arabic, because the statistical machine translation (SMT) system usually produces undiacritized text output. Many words in the latter possess multiple pronunciations; the correct choice must be inferred from context. In this paper, we present a weakly supervised pronunciation prediction approach for undiacritized dialectal Arabic in S2S systems that leverages automatic speech recognition (ASR) to obtain parallel training data for pronunciation prediction. Additionally, we show that incorporating source language features derived from SMT-generated automatic word alignment further improves automatic pronunciation prediction accuracy.

机译：文本到语音合成（TTS）是语音 - tospeech（S2S）翻译流水线中的最终阶段，在目标语言中产生可听文本的可听迭代。 TTS系统通常依赖于Lexicon以查找输入文本中的每个单词的发音。当目标语言是辩证阿拉伯语时，这是有问题的，因为统计机器翻译（SMT）系统通常会产生未经编译的文本输出。后者的许多单词都有多个发音;必须从上下文中推断出正确的选择。在本文中，我们在S2S系统中为未经译码的语言阿拉伯语发出了弱监督的发音预测方法，其利用自动语音识别（ASR）来获得语音预测的并行训练数据。此外，我们表明，源语言源语言功能源自SMT生成的自动字对齐，进一步提高了自动发音预测精度。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Ananthakrishnan S.; Tsakalidis S.; Prasad R.; Natarajan P.; Vembu A.N.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
入库时间 2022-08-20 19:54:59

相似文献

外文文献
中文文献
专利

1. Development of an automatic phonetization system for Arabic text-to-speech synthesis [J] . Faycal Imedjdouben, Amrane Houacine International journal of speech technology . 2014,第4期

机译：开发用于阿拉伯文本语音合成的自动语音系统
2. Cross-Dialect Adaptation Framework for Constructing Prosodic Models for Chinese Dialect Text-to-Speech Systems [J] . Chen-Yu Chiang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第1期

机译：跨方言适应框架构建汉语方言文本语音系统的韵律模型
3. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect [J] . Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Procedia Computer Science . 2017,第1期

机译：阿拉伯语Loria自动语音识别系统（ALASR）的开发及其对阿尔及利亚方言的评估
4. Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system [C] . Ananthakrishnan, Sankaranarayanan IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：在语音到语音翻译系统中用于方言阿拉伯语文本到语音合成的自动语音预测
5. An Investigation into Approaches to Text-to-Speech Synthesis for Modern Standard Arabic [D] . ?Alabbad, Dena A. 2019

机译：现代标准阿拉伯文文本综合综合方法的调查
6. A Neural Machine Translation Model for Arabic Dialects That Utilises Multitask Learning (MTL) [O] . Laith H. Baniata, Seyoung Park, Seong-Bae Park 2018

机译：利用多任务学习（MTL）的阿拉伯语神经机器翻译模型
7. An Analysis of Machine Translation and Speech Synthesis in Speech-To-Speech Translation System [O] . Hashimoto, Kei, Yamagishi, Junichi, Byrne, William, 2011

机译：语音转换系统中的机器翻译和语音合成分析

Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system

摘要

著录项

相似文献

相关主题

期刊订阅