CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset

Houwei Cao; Cooper D.G.; Keutmann M.K.; Gur R.C.; Nenkova A.; Verma R.

首页> 外文期刊>Affective Computing, IEEE Transactions on >CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset

【24h】

CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset

机译：CREMA-D：来自人群的情感多式联运演员数据集

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

People convey their emotional state in their face and voice. We present an audio-visual dataset uniquely suited for the study of multi-modal emotion expression and perception. The dataset consists of facial and vocal emotional expressions in sentences spoken in a range of basic emotional states (happy, sad, anger, fear, disgust, and neutral). 7,442 clips of 91 actors with diverse ethnic backgrounds were rated by multiple raters in three modalities: audio, visual, and audio-visual. Categorical emotion labels and real-value intensity values for the perceived emotion were collected using crowd-sourcing from 2,443 raters. The human recognition of intended emotion for the audio-only, visual-only, and audio-visual data are 40.9, 58.2 and 63.6 percent respectively. Recognition rates are highest for neutral, followed by happy, anger, disgust, fear, and sad. Average intensity levels of emotion are rated highest for visual-only perception. The accurate recognition of disgust and fear requires simultaneous audio-visual cues, while anger and happiness can be well recognized based on evidence from a single modality. The large dataset we introduce can be used to probe other questions concerning the audio-visual perception of emotion.

机译：人们通过面部和声音传达情绪状态。我们提出了一个独特的视听数据集，适合用于多模式情感表达和感知的研究。该数据集由在一系列基本情绪状态（快乐，悲伤，愤怒，恐惧，厌恶和中立）中说出的句子中的面部和语音情绪表达组成。多个评估者以三种方式对音频，视觉和视听方式对91名具有不同种族背景的演员的7,442个片段进行了评估。类别情感标签和感知到的情感的实际值强度值是使用来自2443位评估者的众包来收集的。人类对于纯音频，纯视觉和视听数据的预期情感的识别率分别为40.9％，58.2％和63.6％。中立的识别率最高，其次是快乐，愤怒，厌恶，恐惧和悲伤。对于仅视觉感知，平均情感强度水平最高。厌恶和恐惧的准确识别需要同时出现的视听线索，而愤怒和幸福可以根据单一形式的证据得到很好的识别。我们介绍的大型数据集可用于探究有关情感的视听感知的其他问题。

著录项

来源
《Affective Computing, IEEE Transactions on》 |2014年第4期|377-390|共14页
作者
Houwei Cao; Cooper D.G.; Keutmann M.K.; Gur R.C.; Nenkova A.; Verma R.;
展开▼
作者单位

Dept. of Radiol., Univ. of Pennsylvania, Philadelphia, PA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
emotion recognition; face recognition; speech recognition; CREMA-D; audio-visual dataset; crowd-sourced emotional multimodal actor dataset; emotional state; ethnic backgrounds; human emotion recognition; multimodal emotion expression; multimodal emotion perception; Audio-visual systems; Crowdsourcing; Databases; Emotion recognition; Emotional corpora; facial expression; multi-modal recognition; voice expression;

机译：情感识别;面部识别;语音识别;CREMA-D;视听数据集;人群来源的情感多模态演员数据集;情绪状态;种族背景;人类情感识别;多模态情感表达;多模态情感感知;视听系统;众包数据库;情感识别;情感语料库;表情;多模式识别;语音表达;

相似文献

外文文献
中文文献
专利

1. Emotional Labor Actors: A Latent Profile Analysis of Emotional Labor Strategies [J] . Gabriel Allison S., Daniels Michael A., Diefendorff James M., Journal of Applied Psychology . 2015,第3期

机译：情绪劳动者：情绪劳动策略的潜在特征分析
2. Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model [J] . Hong Danfeng, Hu Jingliang, Yao Jing, ISPRS Journal of Photogrammetry and Remote Sensing . 2021,第Auga期

机译：具有共享和特定特征学习模型的土地覆盖分类的多模式遥感基准数据集
3. Self-supervised multimodal reconstruction of retinal images over paired datasets [J] . Hervella Alvaro S., Rouco Jose, Novo Jorge, Expert Systems with Application . 2020,第Deca期

机译：复合数据集的自我监督多峰重建视网膜图像
4. ROSbag-based Multimodal Affective Dataset for Emotional and Cognitive States [C] . Wonse Jo, Shyam Sundar Kannan, Go-Eum Cha, IEEE International Conference on Systems, Man, and Cybernetics . 2020

机译：基于ROSBAG的多模式情感数据集，用于情绪和认知状态
5. Commodity-Based Freight Activity on Inland Waterways Through the Fusion of Public Datasets for Multimodal Transportation Planning [D] . Asborno, Magdalena I. 2020

机译：通过融合公共数据集进行多式联运规划的商品基于货运的货运活动
6. CREMA-D: Crowd-sourced Emotional Multimodal Actors Dataset [O] . Houwei Cao, David G. Cooper, Michael K. Keutmann, -1

机译：CREMA-D：来自人群的情绪多式联运演员数据集
7. CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset [O] . Houwei Cao, David G. Cooper, Michael K. Keutmann, 2014

机译：CREMA-D：人群源情绪多峰演员数据集

CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset

摘要

著录项

相似文献

相关主题

期刊订阅