首页> 外文会议>International Conference on Text, Speech and Dialogue >Minimum Text Corpus Selection for Limited Domain Speech Synthesis
【24h】

Minimum Text Corpus Selection for Limited Domain Speech Synthesis

机译:限量域语音合成的最低文本语料库选择

获取原文

摘要

This paper concerns limited domain TTS system based on the con-catenative method, and presents an algorithm capable to extract the minimal domain-oriented text corpus from the real data of the given domain, while still reaching the maximum coverage of the domain. The proposed approach ensures that the least amount of texts are extracted, containing the most common phrases and (possibly) all the words from the domain. At the same time, it ensures that appropriate phrase overlapping is kept, allowing to find smooth concatenation in the overlapped regions to reach high quality synthesized speech. In addition, several recommendations allowing a speaker to record the corpus more fluently and comfortably are presented and discussed. The corpus building is tested and evaluated on several domains differing in size and nature, and the authors present the results of the algorithm and demonstrate the advantages of using the domain oriented corpus for speech synthesis.
机译:本文涉及基于配置方法的有限域TTS系统,并提出了一种能够从给定域的实际数据提取最小域的文本语料库的算法,同时仍然达到域的最大覆盖范围。所提出的方法可确保提取最少的文本,其中包含最常见的短语和(可能)来自域中的所有单词。同时,它确保保持适当的短语重叠,允许在重叠区域中找到平滑的连接以达到高质量的合成语音。此外,允许发言者更流利和舒适地讨论扬声器的若干建议,并讨论并讨论。在大小和性质不同的域测试和评估语料库建筑物,作者呈现了算法的结果,并展示了使用域导向语料库的语音合成的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号