首页> 外文期刊>Signal, Image and Video Processing >Towards a high quality Arabic speech synthesis system based on neural networks and residual excited vocal tract model - Springer
【24h】

Towards a high quality Arabic speech synthesis system based on neural networks and residual excited vocal tract model - Springer

机译:建立基于神经网络和残余激励声道模型的高质量阿拉伯语语音合成系统-Springer

获取原文
获取原文并翻译 | 示例
       

摘要

Text-to-speech conversion has traditionally been performed either by concatenating short samples of speech or by using rule-based systems to convert a phonetic representation of speech into an acoustic representation, which is then converted into speech. This paper describes a text-to-speech synthesis system for modern standard Arabic based on artificial neural networks and residual excited LPC coder. The networks offer a storage-efficient means of synthesis without the need for explicit rule enumeration. These neural networks require large prosodically labeled continuous speech databases in their training stage. As such databases are not available for the Arabic language, we have developed one for this purpose. Thus, we discuss various stages undertaken for this development process. In addition to interpolation capabilities of neural networks, a linear interpolation of the coder parameters is performed to create smooth transitions at segment boundaries. A residual-excited all pole vocal tract model and a prosodic-information synthesizer based on neural networks are also described in this paper.
机译:传统上,通过连接简短的语音样本或使用基于规则的系统将语音的语音表示转换为声音表示,然后将其转换为语音,即可执行文本到语音的转换。本文介绍了一种基于人工神经网络和残余激励LPC编码器的现代标准阿拉伯语文本到语音合成系统。网络提供了一种存储有效的综合手段,而无需显式的规则枚举。这些神经网络在训练阶段需要大量带有标签的连续语音数据库。由于无法使用阿拉伯语数据库,因此我们为此目的开发了一个数据库。因此,我们讨论了该开发过程的各个阶段。除了神经网络的内插功能外,还执行编码器参数的线性内插以在段边界处创建平滑过渡。本文还描述了残差激励的全极声道模型和基于神经网络的韵律信息合成器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号