首页> 外文会议>Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops >The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech
【24h】

The Performance of the Speaking Rate Parameter in Emotion Recognition from Speech

机译:语音参数在语音识别中的性能

获取原文
获取原文并翻译 | 示例

摘要

The speaking rate is a quite obvious prosodic characteristic of speech and humans can easily estimate how fast an interlocutor is talking. Further, different emotional dispositions of a person are strongly expressed in his/her speaking rate. In this paper we investigate the performance gain originating from the use of the speaking rate parameter in emotion recognition from speech. The speaking rates are determined by applying a broad phonetic class recognizer. The classifier is trained on cepstral features extracted on the emotionally neutral RM1 speech corpus and provides low average recognition errors of one phoneme/second. We present the results of an empirical approach on the emotionally expressive Emo-DB corpus applying a neural network classifier and prove the significant influence of the speaking rate in emotion classification. The performances of Multi-Layer Perceptrons trained on cepstral turn-level features are analyzed with respect to the presence and absence of the speaking rate feature. An increase of accuracy up to 3.7% in certain emotion categories is reported.
机译:语速是语音的一个很明显的韵律特征,人们可以轻松地估计对话者说话的速度。此外,一个人的不同情感倾向在他/她的语速中得到强烈表达。在本文中,我们研究了在语音情感识别中,由于使用了语速参数而导致的性能提升。语音速率是通过应用广泛的语音分类识别器来确定的。分类器接受了在情感中性的RM1语音语料库上提取的倒谱特征的训练,并提供了一个音素/秒的低平均识别误差。我们提出了使用神经网络分类器对情感表达Emo-DB语料库进行实证研究的结果,并证明了语速在情感分类中的重要影响。关于倒谱特征,对多层感知器的性能进行了倒谱特性的分析。据报道,在某些情绪类别中,准确性提高了3.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号