【24h】

Intonation Modelling and Adaptation for Emotional Prosody Generation

机译:情绪韵律生成的语调建模与适应

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes an HMM-based approach to generating emotional intonation patterns. A set of models were built to represent syllable-length intonation units. In a classification framework, the models were able to detect a sequence of intonation units from raw fundamental frequency values. Using the models in a generative framework, we were able to synthesize smooth and natural sounding pitch contours. As a case study for emotional intonation generation, Maximum Likelihood Linear Regression (MLLR) adaptation was used to transform the neutral model parameters with a small amount of happy and sad speech data. Perceptual tests showed that listeners could identify the speech with the sad intonation 80% of the time. On the other hand, listeners formed a bimodal distribution in their ability to detect the system generated happy intontation and on average listeners were able to detect happy intonation only 46% of the time.
机译:本文提出了一种基于HMM的方法来生成情绪语调模式。建立了一组表示音节长度音调单位的模型。在分类框架中,这些模型能够从原始基本频率值中检测出一系列语调单元。在生成框架中使用这些模型,我们能够合成平滑自然的音高等高线。作为情感语调生成的案例研究,使用最大似然线性回归(MLLR)自适应来转换带有少量快乐和悲伤语音数据的中性模型参数。知觉测试表明,听众可以在80%的时间内识别出悲伤的语调。另一方面,听众在检测系统产生的快乐音调的能力上形成了双峰分布,平均而言,听众仅能在46%的时间内检测到快乐语调。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号