首页> 外文会议>2013 World Congress on Computer and Information Technology >Arabic text to speech synthesis based on neural networks for MFCC estimation
【24h】

Arabic text to speech synthesis based on neural networks for MFCC estimation

机译:基于神经网络的MFCC阿拉伯文本语音合成

获取原文
获取原文并翻译 | 示例

摘要

With the increasing number of users of text to speech applications, high quality speech synthesis is required. However, only few researches concern Arabic text to speech applications. Compared with other languages such as English and French the quality of Arabic synthesis speech is still poor. For these reasons, we propose in this paper an Arabic text to speech synthesis system based on statistical parametric synthesis. Mel Frequency Cepstral Coefficients (MFCC), energy and pitch are predicted using back propagation artificial neural networks and then transformed into speech using Mel Log Spectrum Approximation filter. Often, in Arabic written text, the short vowels called diacritic marks are omitted. So, a diacritization system is proposed to resolve this problem. Different unit sizes are considered in speech database which are phoneme, diphone and triphone. MFCC neural network architecture and an objective evaluation with the MFCC distortion measure are given in this paper.
机译:随着文本到语音应用程序用户的增加,需要高质量的语音合成。然而,只有很少的研究涉及阿拉伯文本到语音的应用。与英语和法语等其他语言相比,阿拉伯文综合语音的质量仍然很差。由于这些原因,我们在本文中提出了一种基于统计参数综合的阿拉伯文语音合成系统。使用反向传播人工神经网络预测梅尔频率倒谱系数(MFCC),能量和音调,然后使用梅尔对数频谱逼近滤波器将其转换为语音。在阿拉伯文字中,通常会省略称为变音符号的短元音。因此,提出了一种非锐化系统来解决该问题。语音数据库中考虑了不同的单位大小,它们是音素,双音和三音。本文给出了MFCC神经网络架构以及使用MFCC失真度量的客观评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号