Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

机译：跨企业维声学情感识别的上下文建模：挑战和混淆。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, focus of research in the field of affective computing was shifted to spontaneous interactions and time-continuous annotations. Such data enlarge the possibility for real-world emotion recognition in the wild, but also introduce new challenges. Affective computing is a research area, where data collection is not a trivial and cheap task; therefore it would be rational to use all the data available. However, due to the subjective nature of emotions, differences in cultural and linguistic features as well as environmental conditions, combining affective speech data is not a straightforward process. In this paper, we analyze difficulties of automatic emotion recognition in time-continuous, dimensional scenario using data from RECOLA, SEMAINE and Cre-ativeIT databases. We propose to employ a simple but effective strategy called "mixup" to overcome the gap in feature-target and target-target covariance structures across corpora. We showcase the performance of our system in three different cross-corpus experimental setups: single-corpus training, two-corpora training and training on augmented (mixed up) data. Findings show that the prediction behavior of trained models heavily depends on the covariance structure of the training corpus, and mixup is very effective in improving cross-corpus acoustic emotion recognition performance of context dependent LSTM models.

机译：最近，情感计算领域的研究重点转移到了自发交互和时间连续注释上。这些数据扩大了在野外进行现实世界中情感识别的可能性，但同时也带来了新的挑战。情感计算是一个研究领域，数据收集并不是一项琐碎而廉价的任务。因此，使用所有可用数据将是合理的。但是，由于情感的主观性质，文化和语言特征以及环境条件的差异，组合情感语音数据并不是一个简单的过程。在本文中，我们使用RECOLA，SEMAINE和Cre-ativeIT数据库中的数据分析了在时间连续的维度场景中自动情感识别的困难。我们建议采用一种简单但有效的策略，称为“混合”，以克服整个语料库中特征-目标和目标-目标协方差结构中的差距。我们在三种不同的跨主体实验设置中展示了我们系统的性能：单主体训练，两主体训练和增强（混合）数据训练。研究结果表明，训练模型的预测行为在很大程度上取决于训练语料库的协方差结构，并且混合对于提高上下文相关LSTM模型的跨语料库声情感识别性能非常有效。

著录项

来源
《International Conference on speech and computer》|2018年|155-165|共11页
会议地点
作者
Dmitrii Fedotov; Heysem Kaya; Alexey Karpov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cross-corpus emotion recognition Time-continuous emotion recognition; Data augmentation;

机译：跨主体情感识别时间连续的情感识别;资料扩充;

相似文献

外文文献
中文文献
专利

1. Cross-Corpus Acoustic Emotion Recognition with Multi-Task Learning: Seeking Common Ground While Preserving Differences [J] . Zhang Biqiao, Provost Emily Mower, Essl Georg Affective Computing, IEEE Transactions on . 2019,第1期

机译：具有多任务学习的跨Corpus声学情感识别：在保持差异的同时寻求共同点
2. Cross-Corpus Acoustic Emotion Recognition with Multi-Task Learning: Seeking Common Ground While Preserving Differences [J] . Zhang Biqiao, Provost Emily Mower, Essl Georg Affective Computing, IEEE Transactions on . 2019,第1期

机译：交叉语料库声学情感认同与多任务学习：在保持差异的同时寻求共同点
3. Efficient and effective strategies for cross-corpus acoustic emotion recognition [J] . Kaya Heysem, Karpov Alexey A. Neurocomputing . 2018,第JANa31期

机译：跨主体声学情感识别的有效策略
4. Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup [C] . Dmitrii Fedotov, Heysem Kaya, Alexey Karpov International Conference on Speech and Computer . 2018

机译：Cross-Corpus尺寸声学情感识别的上下文建模：挑战和混合
5. Three-dimensional generic elastic models for two-dimensional pose synthesis and face recognition. [D] . Heo, Jingu. 2010

机译：用于二维姿态合成和面部识别的三维通用弹性模型。
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. Unsupervised Learning in Cross-Corpus Acoustic Emotion Recognition [O] . Zixing Zhang, Felix Weninger, Martin Wöllmer, 2011

机译：跨Corpus声学情感识别中的无监督学习

Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

摘要

著录项

相似文献

相关主题

期刊订阅