Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

机译：使用生成过程模型表示通过基于HMM的语音合成生成的基本频率轮廓

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F0) contour generation by HMM-based speech synthesis. A method is developed to modify F0 contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F0 variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-on- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.

机译：逐帧表示不适用于韵律特征，而韵律特征与传播时间跨度较大的语音单元（如单词，短语等）紧密相关。这会导致通过基于HMM的语音合成在基频（F 0 ）轮廓生成中产生继承问题。开发了一种通过参考输入文本的语言信息（词边界和重音类型）在生成过程模型的框架内修改F 0 轮廓的方法。在此过程中，它考虑了通过基于HMM的语音合成获得的F 0 方差。通过对合成语音的聆听实验，与基于HMM的语音合成平均相比，该方法可产生更好的质量。由于生成过程模型可以清楚地将其命令和语言（以及副语言/非语言）信息相关联，因此该方法还有一个优势。更改语音样式和/或添加其他信息（例如重点）可以通过操作命令轻松完成。

著录项

来源
《2011 IEEE International Workshop on Machine Learning for Signal Processing》|2011年|p.1-6|共6页
会议地点
作者
Hirose Keikichi; Matsuda Tatsuya; Hashimoto Hiroya; Minematsu Nobuaki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信号处理;
关键词
HMM-based speech synthesis; flexible control; fundamental frequency contour; generation process model; linguistic information;

机译：基于HMM的语音合成;柔性控制;基本频率轮廓;生成过程模型;语言信息;

相似文献

外文文献
中文文献
专利

1. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [J] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu Acoustical science and technology . 2012,第4期

机译：将生成过程模型约束应用于基于隐马尔可夫模型的语音合成生成的基频轮廓
2. F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system [J] . Sankar Mukherjee, Shyamal Kumar Das Mandal International journal of speech technology . 2015,第1期

机译：使用基于孟加拉语Hmm的语音合成系统进行F_0轮廓生成和合成
3. A Control of Fundamental Frequency Contour for Hidden Markov Model-Based Thai Speech Synthesis [J] . Suphattharachai Chomphan American journal of applied sciences . 2012,第2期

机译：基于隐马尔可夫模型的泰语语音合成的基本频率轮廓控制
4. Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model [C] . Hirose Keikichi, Matsuda Tatsuya, Hashimoto Hiroya, IEEE International Workshop on Machine Learning for Signal Processing . 2011

机译：代表基于HMM的语音合成产生的基本频率轮廓使用生成过程模型
5. The Effects of Fundamental Frequency Contours on the Intelligibility Benefit of Clear Speech in Native Speakers of American English and Native Speakers of Seoul Korean [D] . Han, Heekyung. 2019

机译：基础频率轮廓对美国英语和韩语母语人士母语师语清晰讲话的可懂度效益的影响
6. Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners [O] . Linjun Zhang, Yu Li, Han Wu, -1

机译：语义上下文和基本频率等高线对第二语言学习者普通话语音识别的影响
7. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [O] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu 2012

机译：基于Mark-Markov模型的语音合成应用生成过程模型约束

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

摘要

著录项

相似文献

相关主题

期刊订阅