Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

Yamagishi J.; Nose T.; Zen H.; Ling Z.-H.; Toda T.; Tokuda K.; King S.; Renals S.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

【24h】

Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

机译：强大的基于说话人自适应HMM的文本到语音合成

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper describes a speaker-adaptive HMM-based speech synthesis system. The new system, called “HTS-2007,” employs speaker adaptation (CSMAPLR+MAP), feature-space adaptive training, mixed-gender modeling, and full-covariance modeling using CSMAPLR transforms, in addition to several other techniques that have proved effective in our previous systems. Subjective evaluation results show that the new system generates significantly better quality synthetic speech than speaker-dependent approaches with realistic amounts of speech data, and that it bears comparison with speaker-dependent approaches even when large amounts of speech data are available. In addition, a comparison study with several speech synthesis techniques shows the new system is very robust: It is able to build voices from less-than-ideal speech data and synthesize good-quality speech even for out-of-domain sentences.

机译：本文介绍了一种基于说话者自适应的基于HMM的语音合成系统。新系统称为“ HTS-2007”，除了已证明有效的其他几种技术之外，还采用了说话人自适应（CSMAPLR + MAP），特征空间自适应训练，混合性别建模和使用CSMAPLR变换的全协方差建模。在我们以前的系统中。主观评估结果表明，与具有真实语音数据量的基于说话者的方法相比，新系统产生的合成语音质量要好得多，并且即使有大量语音数据，它也可以与基于说话者的方法进行比较。此外，与几种语音合成技术的比较研究表明，该新系统非常健壮：它能够从不理想的语音数据中构建语音，甚至可以针对域外句子合成高质量的语音。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2009年第6期|p.1208-1230|共23页
作者
Yamagishi J.; Nose T.; Zen H.; Ling Z.-H.; Toda T.; Tokuda K.; King S.; Renals S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Average voice; HMM Speech Synthesis System, HTS; HMM-based speech synthesis; speaker adaptation; speech synthesis; voice conversion;

机译：平均语音;HMM语音合成系统;HTS;基于HMM的语音合成;说话人自适应;语音合成;语音转换;

相似文献

外文文献
中文文献
专利

1. Prosody Correction Preserving Speaker Individuality for Chinese-Accented Japanese HMM-Based Text-to-Speech Synthesis [J] . Daiki SEKIZAWA, Shinnosuke TAKAMICHI, Hiroshi SARUWATARI IEICE transactions on information and systems . 2019,第6期

机译：基于汉字的日本HMM语音合成中保留韵律校正的说话人个性
2. Incorporating a Mixed Excitation Model and Postfilter into HMM-Based Text-to-Speech Synthesis [J] . Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Systems and Computers in Japan . 2005,第12期

机译：在基于HMM的文本到语音合成中纳入混合激励模型和后置滤波器
3. Design of co-processor for real-time HMM-based text-to-speech on hardware system applied to Vietnamese [J] . Cong-Kha Pham, Duc-Hung Le, Hieu-Binh Nguyen, IEICE Electronics Express . 2015,第14期

机译：基于实时HMM的越南语硬件语音转语音协处理器设计
4. Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis [C] . Junichi Yamagishi, Oliver Watts, Simon King, Annual conference of the International Speech Communication Association;INTERSPEECH 2010 . 2011

机译：平均语音在基于说话者自适应HMM的语音合成中的作用
5. Text-to-Speech Synthesis Using Found Data for Low-Resource Languages [D] . Cooper, Erica 2019

机译：使用低资源语言的数据对文本进行语音合成
6. Robust Magnetized Graphene Oxide Platform for In Situ Peptide Synthesis and FRET-Based Protease Detection [O] . Seongsoo Kim, Sang-Myung Lee, Je Pil Yoon, 2020

机译：用于原位肽合成和基于FRET的蛋白酶检测的鲁棒磁化石墨烯氧化物平台
7. Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis [O] . Yamagishi, J., Nose, T., Zen, H., 2009

机译：基于鲁棒的说话人自适应Hmm的文本到语音合成

Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅