A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM

Takashi NOSE; Takao KOBAYASHI

首页> 外文期刊>IEICE transactions on information and systems >A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM

【24h】

A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM

机译：基于多元回归HSMM的语音情感表达强度和说话风格估计技术

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a technique for estimating the degree or intensity of emotional expressions and speaking styles appearing in speech. The key idea is based on a style control technique for speech synthesis using a multiple regression hidden semi-Markov model (MRHSMM), and the proposed technique can be viewed as the inverse of the style control. In the proposed technique, the acoustic features of spectrum, power, fundamental frequency, and duration are simultaneously modeled using the MRHSMM. We derive an algorithm for estimating explanatory variables of the MRHSMM, each of which represents the degree or intensity of emotional expressions and speaking styles appearing in acoustic features of speech, based on a maximum likelihood criterion. We show experimental results to demonstrate the ability of the proposed technique using two types of speech data, simulated emotional speech and spontaneous speech with different speaking styles. It is found that the estimated values have correlation with human perception.

机译：在本文中，我们提出了一种用于估计语音中出现的情感表达和说话方式的程度或强度的技术。关键思想是基于使用多重回归隐藏半马尔可夫模型（MRHSMM）进行语音合成的样式控制技术，并且可以将所提出的技术视为样式控制的逆向。在提出的技术中，使用MRHSMM同时对频谱，功率，基频和持续时间的声学特征进行建模。我们推导了一种算法，用于估计MRHSMM的解释变量，该算法基于最大似然准则来表示情绪表达的程度或强度以及出现在语音声学特征中的说话风格。我们显示了实验结果，以证明使用两种类型的语音数据（模拟的情感语音和具有不同语音风格的自发语音）提出的技术的能力。发现估计值与人类感知相关。

著录项

来源
《IEICE transactions on information and systems》 |2010年第1期|共9页
作者
Takashi NOSE; Takao KOBAYASHI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM [J] . Takashi NOSE, Takao KOBAYASHI IEICE Transactions on Information and Systems . 2010,第1期

机译：基于多元回归HSMM的语音情感表达强度和说话风格估计技术
2. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [J] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, IEICE transactions on information and systems . 2010,第1期

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术
3. A Rapid Model Adaptation Technique for Emotional Speech Recognition with Style Estimation Based on Multiple-Regression HMM [J] . Yusuke IJIMA, Takashi NOSE, Makoto TACHIBANA, IEICE Transactions on Information and Systems . 2010,第1期

机译：基于多元回归HMM的带样式估计的情感语音快速模型自适应技术
4. An Estimation Technique of Style Expressiveness for Emotional SpeechUsing Model Adaptation Based on Multiple-Regression HSMM [C] . Takashi Nose, Yoichi Kato, Makoto Tachibana, International Speech Communication Association . 2008

机译：基于多元回归HSMM的情感言论模型适应风格表达估算技术
5. Perception and Production of Emotional Prosody in the Speech of Mandarin-Speaking Adults with Cochlear Implants [D] . Pak, Cecilia Liu. 2018

机译：普通话成年人与人工耳蜗的讲话中对情绪韵律的感知和产生
6. Effects of speaking style on speech intelligibility for Mandarin-speaking cochlear implant users [O] . Yongxin Li, Guoping Zhang, Hou-yong Kang, -1

机译：口语风格对讲普通话的人工耳蜗使用者语音清晰度的影响
7. A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM [O] . Takashi NOSE, Takao KOBAYASHI 2010

机译：基于多元回归HSMM的语音中情感表达强度估算情感表达强度的技术
8. Effects of Feature Type, Learning Algorithm and Speaking Style for Depression Detection from Speech. [R] . Mitra, V., Shriberg, E. 2015

机译：特征类型，学习算法和语音风格对语音抑郁检测的影响。

A Technique for Estimating Intensity of Emotional Expressions and Speaking Styles in Speech Based on Multiple-Regression HSMM

摘要

著录项

相似文献

相关主题

期刊订阅