首页> 外文会议>2014 12th International Conference on Signal Processing >Reducing footprint of unit selection TTS system by removing linguistic segments with rarely selected units
【24h】

Reducing footprint of unit selection TTS system by removing linguistic segments with rarely selected units

机译:通过删除很少选择的单元的语言段来减少单元选择TTS系统的占用空间

获取原文

摘要

This paper is focused on reducing the size of speech corpora that are used in the unit-selection-based TTS systems. The size of a speech corpus influences the system requirements like storage and memory demands and computational complexity. For high quality speech synthesis, the speech corpus usually consists of several thousands of sentences. Thus an appropriate reduction of the corpus size is likely to lead to a decrease in the system requirements. In this work, a comparison of impacts on synthetic speech quality is presented when removing specific instances of different linguistic segment types from the original corpus. Removal of the following segment types is used and compared with each other: whole sentences, phrases, words, and diphones. Only segments with rarely selected units are removed from the corpus so that the resulting footprint size reaches a predefined value. Results confirm that synthetic speech generated by the TTS systems using the reduced corpora is of a slightly worse quality when compared with speech produced by the system employing the original full corpus. The comparison of the reduction based on different linguistic segments is also presented here.
机译:本文的重点是减少基于单元选择的TTS系统中使用的语音语料库的大小。语音语料库的大小会影响系统要求,例如存储和内存要求以及计算复杂性。对于高质量的语音合成,语音语料库通常由数千个句子组成。因此,适当减小语料库大小可能会导致系统要求降低。在这项工作中,当从原始语料库中删除不同语言段类型的特定实例时,比较了对合成语音质量的影响。使用以下句段类型并将其相互比较:整个句子,短语,单词和双音节。仅从语料库中删除单位很少选择的片段,以使生成的覆盖区大小达到预定义的值。结果证实,与使用原始完整语料库的系统所产生的语音相比,使用减少语料库的TTS系统所生成的合成语音质量稍差。这里还介绍了基于不同语言段的归约的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号