首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language
【24h】

Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language

机译:音调重音自关注增强型Tacotron语音合成系统的研究

获取原文

摘要

End-to-end speech synthesis is a promising approach that directly converts raw text to speech. Although it was shown that Tacotron2 outperforms classical pipeline systems with regards to naturalness in English, its applicability to other languages is still unknown. Japanese could be one of the most difficult languages for which to achieve end-to-end speech synthesis, largely due to its character diversity and pitch accents. Therefore, state-of-the-art systems are still based on a traditional pipeline framework that requires a separate text analyzer and duration model. Towards end-to-end Japanese speech synthesis, we extend Tacotron to systems with self-attention to capture long-term dependencies related to pitch accents and compare their audio quality with classical pipeline systems under various conditions to show their pros and cons. In a large-scale listening test, we investigated the impacts of the presence of accentual-type labels, the use of force or predicted alignments, and acoustic features used as local condition parameters of the Wavenet vocoder. Our results reveal that although the proposed systems still do not match the quality of a top-line pipeline system for Japanese, we show important stepping stones towards end-to-end Japanese speech synthesis.
机译:端到端语音合成是一种有前途的方法,可以直接将原始文本转换为语音。尽管已证明Tacotron2在英语自然性方面要优于经典管道系统,但其在其他语言中的适用性仍然未知。日语可能是实现端到端语音合成的最困难的语言之一,这在很大程度上是由于日语的字符多样性和音调重音。因此,最新的系统仍然基于传统的流水线框架,该框架需要单独的文本分析器和持续时间模型。为了实现端到端的日语语音合成,我们将Tacotron扩展到具有自我注意力的系统,以捕获与音调重音有关的长期依赖性,并将其音频质量与各种条件下的经典管道系统进行比较,以显示其优缺点。在大规模的听力测试中,我们调查了重音类型标签的存在,使用力或预测的对齐方式以及用作Wavenet声码器局部条件参数的声学特征的影响。我们的结果表明,尽管所提出的系统仍与日语的顶级流水线系统的质量不匹配,但我们显示了走向端到端日语语音合成的重要垫脚石。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号