Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

机译：代表基于HMM的语音合成产生的基本频率轮廓使用生成过程模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F0) contour generation by HMM-based speech synthesis. A method is developed to modify F0 contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F0 variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-/non- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.

机译：逐帧表示不适用于韵律特征，其与扩展宽时间跨度的语音单元紧密相关，例如单词，短语等。这导致基于HMM的语音合成的基本频率（F 0 ）的继承问题。通过参考输入文本的语言信息（字边界和重音类型），开发了一种方法以在生成过程模型的框架中修改F 0 轮廓。在过程中，通过基于HMM的语音合成获得的F 0 差异。通过对合成语音的聆听试验，证明了与平均肝的语音合成相比，该方法产生了更好的质量。由于生成过程模型可以清楚地涉及其命令和语言（以及副/非语言）信息，因此该方法具有额外的优势;通过操纵命令，可以轻松完成更改语音样式和/或添加更多信息（例如强调）。

著录项

来源
《IEEE International Workshop on Machine Learning for Signal Processing》|2011年||共6页
会议地点
作者
Hirose Keikichi; Matsuda Tatsuya; Hashimoto Hiroya; Minematsu Nobuaki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.7-53;
关键词
HMM-based speech synthesis; flexible control; fundamental frequency contour; generation process model; linguistic information;

机译：基于HMM的语音合成;灵活控制;基波频率轮廓;生成过程模型;语言信息;

相似文献

外文文献
中文文献
专利

1. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [J] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu Acoustical science and technology . 2012,第4期

机译：将生成过程模型约束应用于基于隐马尔可夫模型的语音合成生成的基频轮廓
2. F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system [J] . Sankar Mukherjee, Shyamal Kumar Das Mandal International journal of speech technology . 2015,第1期

机译：使用基于孟加拉语Hmm的语音合成系统进行F_0轮廓生成和合成
3. A Control of Fundamental Frequency Contour for Hidden Markov Model-Based Thai Speech Synthesis [J] . Suphattharachai Chomphan American journal of applied sciences . 2012,第2期

机译：基于隐马尔可夫模型的泰语语音合成的基本频率轮廓控制
4. Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model [C] . Hirose Keikichi, Matsuda Tatsuya, Hashimoto Hiroya, 2011 IEEE International Workshop on Machine Learning for Signal Processing . 2011

机译：使用生成过程模型表示通过基于HMM的语音合成生成的基本频率轮廓
5. The Effects of Fundamental Frequency Contours on the Intelligibility Benefit of Clear Speech in Native Speakers of American English and Native Speakers of Seoul Korean [D] . Han, Heekyung. 2019

机译：基础频率轮廓对美国英语和韩语母语人士母语师语清晰讲话的可懂度效益的影响
6. Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners [O] . Linjun Zhang, Yu Li, Han Wu, -1

机译：语义上下文和基本频率等高线对第二语言学习者普通话语音识别的影响
7. Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis [O] . Tetsuya Matsuda, Keikichi Hirose, Nobuaki Minematsu 2012

机译：基于Mark-Markov模型的语音合成应用生成过程模型约束

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

摘要

著录项

相似文献

相关主题

期刊订阅