首页> 外文会议>International Conference on Affective Computing and Intelligent Interaction >Leveraging Inter-rater Agreement for Audio-Visual Emotion Recognition
【24h】

Leveraging Inter-rater Agreement for Audio-Visual Emotion Recognition

机译:利用帧内视听情感认可的帧间协议

获取原文

摘要

Human expressions are often ambiguous and unclear, resulting in disagreement or confusion among different human evaluators. In this paper, we investigate how audio-visual emotion recognition systems can leverage prototypicality, the level of agreement or confusion among human evaluators. We propose the use of a weighted Support Vector Machine to explicitly model the relationship between the prototypicality of training instances and evaluated emotion from the IEMOCAP corpus. We choose weights of prototypical and non-prototypical instances based on the maximal accuracy of each speaker. We then provide per-speaker analysis to understand specific speech characteristics associated with the information gain of emotion given prototypicality information. Our experimental results show that neutrality, one of the most challenging emotion to recognize, has the highest performance gain from prototypicality information, compared to other emotion classes: Angry, Happy, and Sad. We also show that the proposed method improves the overall multi-class classification accuracy significantly over traditional methods that do not leverage prototypicality.
机译:人类表达通常是暧昧和不清楚的,导致不同人类评估人员之间的分歧或混乱。在本文中,我们调查了视听情感识别系统如何利用原型,人类评估人员之间的协议水平或混淆。我们建议使用加权支持向量机来明确地模拟培训实例的原型与Iemocap语料库的情感之间的关系。我们根据每个扬声器的最大精度选择原型和非原型实例的重量。然后,我们提供每个扬声器分析以了解与情感的信息增益相关联的特定语音特征。我们的实验结果表明,与其他情感课程相比,来自识别的最具挑战性的情感中的中立性,具有最高的性能信息:愤怒,快乐,悲伤。我们还表明,通过不利用原型的传统方法显着提高了整体多级分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号