首页> 外文会议>IEEE International Conference on Multimedia and Expo >The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech
【24h】

The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

机译:语音识别中讲话率参数的表现

获取原文

摘要

The speaking rate is a quite obvious prosodic characteristic of speech and humans can easily estimate how fast an interlocutor is talking. Further, different emotional dispositions of a person are strongly expressed in his/her speaking rate. In this paper we investigate the performance gain originating from the use of the speaking rate parameter in emotion recognition from speech. The speaking rates are determined by applying a broad phonetic class recognizer. The classifier is trained on cepstral features extracted on the emotionally neutral RM1 speech corpus and provides low average recognition errors of one phoneme/second. We present the results of an empirical approach on the emotionally expressive Emo-DB corpus applying a neural network classifier and prove the significant influence of the speaking rate in emotion classification. The performances of Multi-Layer Perceptrons trained on cepstral turn-level features are analyzed with respect to the presence and absence of the speaking rate feature. An increase of accuracy up to 3.7% in certain emotion categories is reported.
机译:发言率是言语的一个相当明显的韵律特征,人类可以很容易地估计交流者的态度。此外,某人的不同情感处置是以他/她的发言率强烈表达的。在本文中,我们调查源自使用说话率参数在情感识别中的性能增益。通过应用广泛的语音类识别器确定说话率。分类器培训对在情绪中性RM1语音语料库上提取的倒谱特征,并提供一个音素/秒的低平均识别误差。我们在应用神经网络分类器的情感表达EMO-DB语料库上介绍了经验方法的结果,并证明了情绪分类中说话率的显着影响。关于讲速率特征的存在和不存在,分析了对临时转弯级别特征培训的多层感知的性能。报告了某些情感类别的准确性增加到3.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号