...
首页> 外文期刊>IEICE transactions on information and systems >A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM
【24h】

A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM

机译:基于多元回归HSMM的语音情感表达强度和说话风格估计技术

获取原文
           

摘要

In this paper, we propose a technique for estimating the degree or intensity of emotional expressions and speaking styles appearing in speech. The key idea is based on a style control technique for speech synthesis using a multiple regression hidden semi-Markov model (MRHSMM), and the proposed technique can be viewed as the inverse of the style control. In the proposed technique, the acoustic features of spectrum, power, fundamental frequency, and duration are simultaneously modeled using the MRHSMM. We derive an algorithm for estimating explanatory variables of the MRHSMM, each of which represents the degree or intensity of emotional expressions and speaking styles appearing in acoustic features of speech, based on a maximum likelihood criterion. We show experimental results to demonstrate the ability of the proposed technique using two types of speech data, simulated emotional speech and spontaneous speech with different speaking styles. It is found that the estimated values have correlation with human perception.
机译:在本文中,我们提出了一种用于估计语音中出现的情感表达和说话方式的程度或强度的技术。关键思想是基于使用多重回归隐藏半马尔可夫模型(MRHSMM)进行语音合成的样式控制技术,并且可以将所提出的技术视为样式控制的逆向。在提出的技术中,使用MRHSMM同时对频谱,功率,基频和持续时间的声学特征进行建模。我们推导了一种算法,用于估计MRHSMM的解释变量,该算法基于最大似然准则来表示情绪表达的程度或强度以及出现在语音声学特征中的说话风格。我们显示了实验结果,以证明使用两种类型的语音数据(模拟的情感语音和具有不同语音风格的自发语音)提出的技术的能力。发现估计值与人类感知相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号