Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech

Yi-Chin Huang; Chung-Hsien Wu; Yan-You Chen; Ming-Ge Shie; Jhing-Fa Wang

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech

【24h】

Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech

机译：使用小型半段半自发语音的个性化自发语音合成

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A systematic approach is proposed to synthesizing personalized spontaneous speech using a small-sized unsegmented speech corpus of the target speaker. First, an automatic segmentation algorithm is employed to segment and label a collected semispontaneous speech corpus of the target speaker. Then, a pretrained average voice model is adapted to the voice model of the target speaker by using the segmented data. A postfilter based on modulation spectrum is adopted to further improve the speaker similarity of the synthesized speech as well as alleviate the over-smoothing problem of the synthesized speech. For generating spontaneous speech, a smoothing method applied at the prosodic word level is proposed to improve speech fluency. For objective evaluation on spontaneous speech segmentation, the segmentation accuracy of the proposed method is superior to that of Viterbi-based forced alignment. The results of subjective listening test also show that the proposed method can improve the spontaneity and speaker similarity of the synthesized speech compared to the maximum likelihood linear regression based speaker adaptation method.

机译：提出了一种系统的方法来使用目标说话者的小型无节段语音语料合成个性化的自发语音。首先，采用自动分割算法对目标说话人收集的半自发语音语料进行分割和标记。然后，通过使用分割的数据，将预训练的平均语音模型适配到目标说话者的语音模型。采用基于调制频谱的后置滤波器，可以进一步提高合成语音的说话人相似度，减轻合成语音的过平滑问题。为了产生自发语音，提出了一种在韵律词水平上应用的平滑方法以提高语音流利度。对于自发语音分割的客观评估，该方法的分割精度优于基于维特比的强制对齐。主观听觉测试的结果还表明，与基于最大似然线性回归的说话人自适应方法相比，该方法可以提高合成语音的自发性和说话人相似性。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2017年第5期|1048-1060|共13页
作者
Yi-Chin Huang; Chung-Hsien Wu; Yan-You Chen; Ming-Ge Shie; Jhing-Fa Wang;
展开▼
作者单位

Department of Information Engineering and Computer Science, Feng Chia University, Taichung, Taiwan;

Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan;

Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan;

Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan;

Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Hidden Markov models; Adaptation models; Speech synthesis; Data models; Smoothing methods;

机译：语音;隐马尔可夫模型;适应模型;语音合成;数据模型;平滑方法;

相似文献

外文文献
中文文献
专利

1. Towards personalized speech synthesis for augmentative and alternative communication [J] . MillsT., BunnellH.T., PatelR. Augmentative and alternative communication: AAC . 2014,第3期

机译：迈向个性化语音合成，以实现补充性和替代性交流
2. Defining Laughter Context for Laughter Synthesis with Spontaneous Speech Corpus [J] . Nagata Tomohiro, Mori Hiroki Affective Computing, IEEE Transactions on . 2020,第3期

机译：用自发语音语料库定义笑声背景下的笑声综合
3. Synthesis of Spontaneous Speech With Syllable Contraction Using State-Based Context-Dependent Voice Transformation [J] . Wu C.-H., Huang Y.-C., Lee C.-H., Audio, Speech, and Language Processing, IEEE Transactions on . 2014,第3期

机译：基于状态的上下文相关语音转换合成带有音节收缩的自发语音
4. Personalized End-to-End Mandarin Speech Synthesis using Small-sized Corpus [C] . ChenHan Yuan, Yi-Chin Huang Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2020

机译：使用小型语料库的个性化端到端普通话语音合成
5. Speech Entrainment to Improve Spontaneous Speech in Broca’s Aphasia [D] . Thors, Helga. 2019

机译：言语夹带改善了博克阿邦的自发言论
6. A Speech Recognition-based Solution for the Automatic Detection of Mild Cognitive Impairment from Spontaneous Speech [O] . László Tóth, Ildikó Hoffmann, Gábor Gosztolya, -1

机译：基于语音识别的自发性语音自动检测轻度认知障碍的解决方案
7. Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis [O] . Andersson Sebastian, Yamagishi Junichi, Clark Robert 2010

机译：在基于HMM的语音合成中使用自发会话语音

Personalized Spontaneous Speech Synthesis Using a Small-Sized Unsegmented Semispontaneous Speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅