Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency

机译：通过更改基本频率的生成过程模型的命令值来调整韵律在语音合成中的作用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A method was developed to adapt prosody to a new speaker/style in speech synthesis. It is based on predicting differences between target and original speakers/styles and applying them to the original one. Differences in fundamental frequency (F_0) contours are represented in the framework of the generation process model; differences in the command magnitudes/amplitudes. While the original one requires a certain amount of training corpus, while corpus for training command differences can be small. Furthermore, in the case of style adaptation, it is not necessarily the corpus being uttered by the same speaker of the original style. Speech synthesis was conducted using HMM-based speech synthesis system, where prosody was controlled by the method. Listening experiments on synthetic speech with style adaptation and voice conversion both showed the validity of the method.

机译：开发了一种使韵律适应语音合成中新的说话者/风格的方法。它基于预测目标说话者/原始说话者/样式之间的差异，并将其应用于原始说话者/样式。在生成过程模型的框架中表示了基本频率（F_0）轮廓的差异。命令幅度/幅度的差异。虽然最初的一个需要一定数量的训练语料，但是用于训练命令的语料差异可能很小。此外，在样式适应的情况下，语料库不一定是由原始样式的同一说话者说出的。使用基于HMM的语音合成系统进行语音合成，其中通过该方法控制韵律。带有风格适应和语音转换的合成语音的听力实验都证明了该方法的有效性。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.2804-2807|共4页
会议地点
作者
Keikichi Hirose; Keiko Ochi; Ryusuke Mihara; Hiroya Hashimoto; Daisuke Saito; Nobuaki Minematsu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
prosody adaptation; generation process model; speech synthesis;

机译：韵律适应生成过程模型;语音合成;

相似文献

外文文献
中文文献
专利

1. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [J] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu Acoustical science and technology . 2012,第4期

机译：将生成过程模型约束应用于基于隐马尔可夫模型的语音合成生成的基频轮廓
2. Corpus-based generation of prosodic features for emotional speech synthesis based on a generation process model and its evaluation [J] . Toshiya Katsura, Keikichi Hirose, Nobuaki Minematsu 電子情報通信学会技術研究報告. 音声. Speech . 2002,第749期

机译：基于生成过程模型的基于语料库的语音特征韵律合成
3. Corpus-based generation of prosodic features for emotional speech synthesis based on a generation process model and its evaluation [J] . Toshiya Katsura, Keikichi Hirose, Nobuaki Minematsu 電子情報通信学会技術研究報告. 音声. Speech . 2002,第749期

机译：基于语料库的基于生成过程模型的情绪语音合成的韵律特征及其评价
4. Improved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis [C] . Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu Annual conference of the International Speech Communication Association . 2012

机译：改进的生成过程模型命令的自动提取及其在生成用于训练基于HMM的语音合成的基本频率轮廓中的用途
5. Speech processing and modeling using a non-linear time-frequency algorithm. [D] . McNamara, David M. 2008

机译：使用非线性时频算法进行语音处理和建模。
6. Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition [O] . Sankaranarayanan Ananthakrishnan, Shrikanth Narayanan -1

机译：类别韵律模型的无监督适应用于韵律标记和语音识别
7. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [O] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu 2012

机译：基于Mark-Markov模型的语音合成应用生成过程模型约束

Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency

摘要

著录项

相似文献

相关主题

期刊订阅