首页> 外文会议>National Conference on Communications >Discriminating High Arousal and Low Arousal Emotional Speech Using Mahalanobis Distance Among Acoustic Features
【24h】

Discriminating High Arousal and Low Arousal Emotional Speech Using Mahalanobis Distance Among Acoustic Features

机译:利用马哈拉诺比斯距离在听觉特征中区分高听觉和低听情绪

获取原文

摘要

Emotion classification from emotional speech continues to be a challenging research domain. Few research studies have attempted to discriminate amongst a set of emotions, and categorize for valence, activation and dominance. Discriminating between high-arousal and low-arousal emotions is itself challenging, but discriminating emotions within each subcategory is further challenging problem. In this study, a new approach is proposed to discriminate between high and low arousal emotions, and also amongst emotions within each subcategory. Mahalanobis distances amongst acoustic feature vectors of emotional speech w.r.t. normal speech are examined. The approach, involving speech production features, has been validated on three databases: German (Berlin EMO-DB), English (RAVDESS) and Telugu (IITKGP-SESC). A common set of five emotions Angry, Happy, Fear, Disgust and Sad are examined with reference to normal speech. The vocal-tract filter features Mel-frequency cepstral coefficients (MFCCs), and combined source-filter features signal energy, zero-crossing rate and duration are used. A 2D projection of Mahalanobis distance for one emotion, w.r.t. normal, onto another emotion is observed to discriminate amongst emotions within each high/low-arousal sub-category. The Angry and Happy emotions are discriminated in high-arousal emotions sub-category, whereas Fear, Disgust and Sad are discriminated in low-arousal emotions sub-category. This study should be helpful in further classifying emotions within each subcategory of high/low arousal emotions in emotional speech.
机译:从情感言语进行情感分类仍然是一个具有挑战性的研究领域。很少有研究试图在一组情绪中进行区分,并对其价,激活和优势进行分类。区分高情绪和低情绪本身是一项挑战,但是区分每个子类别中的情绪则是另一个难题。在这项研究中,提出了一种新的方法来区分高和低觉醒情绪,以及每个子类别中的情绪。 w.r.t.情感语音的声学特征向量之间的Mahalanobis距离检查正常的言语。该方法涉及语音产生功能,已在三个数据库中得到验证:德语(柏林EMO-DB),英语(RAVDESS)和泰卢固语(IITKGP-SESC)。参照正常的言语,研究了一组常见的五种情绪,分别是:愤怒,快乐,恐惧,厌恶和悲伤。声道滤波器具有梅尔频率倒谱系数(MFCC),组合的源滤波器具有信号能量,过零率和持续时间。马哈拉诺比斯距离的2D投影对一种情感的影响正常情况下,观察到另一种情绪,以区分每个高/低唤醒子类别中的情绪。 “愤怒”和“快乐”情感在高情感类别中被区分,而“恐惧”,“厌恶”和“悲伤”在低情感类别中被区分。这项研究应有助于进一步将情绪言语中高/低唤醒情绪的每个子类别中的情绪分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号