...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech
【24h】

Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech

机译:使用小型半段半自发语音的个性化自发语音合成

获取原文
获取原文并翻译 | 示例
           

摘要

A systematic approach is proposed to synthesizing personalized spontaneous speech using a small-sized unsegmented speech corpus of the target speaker. First, an automatic segmentation algorithm is employed to segment and label a collected semispontaneous speech corpus of the target speaker. Then, a pretrained average voice model is adapted to the voice model of the target speaker by using the segmented data. A postfilter based on modulation spectrum is adopted to further improve the speaker similarity of the synthesized speech as well as alleviate the over-smoothing problem of the synthesized speech. For generating spontaneous speech, a smoothing method applied at the prosodic word level is proposed to improve speech fluency. For objective evaluation on spontaneous speech segmentation, the segmentation accuracy of the proposed method is superior to that of Viterbi-based forced alignment. The results of subjective listening test also show that the proposed method can improve the spontaneity and speaker similarity of the synthesized speech compared to the maximum likelihood linear regression based speaker adaptation method.
机译:提出了一种系统的方法来使用目标说话者的小型无节段语音语料合成个性化的自发语音。首先,采用自动分割算法对目标说话人收集的半自发语音语料进行分割和标记。然后,通过使用分割的数据,将预训练的平均语音模型适配到目标说话者的语音模型。采用基于调制频谱的后置滤波器,可以进一步提高合成语音的说话人相似度,减轻合成语音的过平滑问题。为了产生自发语音,提出了一种在韵律词水平上应用的平滑方法以提高语音流利度。对于自发语音分割的客观评估,该方法的分割精度优于基于维特比的强制对齐。主观听觉测试的结果还表明,与基于最大似然线性回归的说话人自适应方法相比,该方法可以提高合成语音的自发性和说话人相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号