首页> 外文会议>International conference on text, speech and dialogue >Minimum Text Corpus Selection for Limited Domain Speech Synthesis
【24h】

Minimum Text Corpus Selection for Limited Domain Speech Synthesis

机译:有限域语音合成的最小文本语料库选择

获取原文
获取外文期刊封面目录资料

摘要

This paper concerns limited domain ITS system based on the con-catenative method, and presents an algorithm capable to extract the minimal domain-oriented text corpus from the real data of the given domain, while still reaching the maximum coverage of the domain. The proposed approach ensures that the least amount of texts are extracted, containing the most common phrases and (possibly) all the words from the domain. At the same time, it ensures that appropriate phrase overlapping is kept, allowing to find smooth concatenation in the overlapped regions to reach high quality synthesized speech. In addition, several recommendations allowing a speaker to record the corpus more fluently and comfortably are presented and discussed. The corpus building is tested and evaluated on several domains differing in size and nature, and the authors present the results of the algorithm and demonstrate the advantages of using the domain oriented corpus for speech synthesis.
机译:本文研究了一种基于级联方法的有限域智能交通系统,提出了一种算法,该算法能够从给定域的真实数据中提取出最小的面向域文本语料库,同时仍能达到该域的最大覆盖范围。所提出的方法确保提取最少数量的文本,其中包含最常用的短语以及(可能)来自领域的所有单词。同时,它确保保持适当的短语重叠,从而允许在重叠区域中找到平滑的级联,以达到高质量的合成语音。此外,还提出并讨论了一些建议,这些建议可以使演讲者更流畅,更舒适地记录语料。语料库的构建在大小和性质不同的多个域上进行了测试和评估,作者介绍了该算法的结果,并证明了使用面向领域的语料库进行语音合成的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号