首页> 外文会议>International Conference on speech and computer >Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup
【24h】

Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

机译:跨企业维声学情感识别的上下文建模:挑战和混淆。

获取原文

摘要

Recently, focus of research in the field of affective computing was shifted to spontaneous interactions and time-continuous annotations. Such data enlarge the possibility for real-world emotion recognition in the wild, but also introduce new challenges. Affective computing is a research area, where data collection is not a trivial and cheap task; therefore it would be rational to use all the data available. However, due to the subjective nature of emotions, differences in cultural and linguistic features as well as environmental conditions, combining affective speech data is not a straightforward process. In this paper, we analyze difficulties of automatic emotion recognition in time-continuous, dimensional scenario using data from RECOLA, SEMAINE and Cre-ativeIT databases. We propose to employ a simple but effective strategy called "mixup" to overcome the gap in feature-target and target-target covariance structures across corpora. We showcase the performance of our system in three different cross-corpus experimental setups: single-corpus training, two-corpora training and training on augmented (mixed up) data. Findings show that the prediction behavior of trained models heavily depends on the covariance structure of the training corpus, and mixup is very effective in improving cross-corpus acoustic emotion recognition performance of context dependent LSTM models.
机译:最近,情感计算领域的研究重点转移到了自发交互和时间连续注释上。这些数据扩大了在野外进行现实世界中情感识别的可能性,但同时也带来了新的挑战。情感计算是一个研究领域,数据收集并不是一项琐碎而廉价的任务。因此,使用所有可用数据将是合理的。但是,由于情感的主观性质,文化和语言特征以及环境条件的差异,组合情感语音数据并不是一个简单的过程。在本文中,我们使用RECOLA,SEMAINE和Cre-ativeIT数据库中的数据分析了在时间连续的维度场景中自动情感识别的困难。我们建议采用一种简单但有效的策略,称为“混合”,以克服整个语料库中特征-目标和目标-目标协方差结构中的差距。我们在三种不同的跨主体实验设置中展示了我们系统的性能:单主体训练,两主体训练和增强(混合)数据训练。研究结果表明,训练模型的预测行为在很大程度上取决于训练语料库的协方差结构,并且混合对于提高上下文相关LSTM模型的跨语料库声情感识别性能非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号