首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of Visual Emotions
【24h】

LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of Visual Emotions

机译:LUCIFER:用于深度学习视觉情感的大规模上下文相关图像数据集

获取原文

摘要

Still image emotion recognition has been receiving increasing attention in recent years due to the tremendous amount of social media content available on the Web. Opinion mining, visual emotion analysis, search and retrieval are among the application areas, to name a few. While there exist works on the subject, offering methods to detect image sentiment; i.e. recognizing the polarity of the image, less efforts focus on emotion analysis; i.e. dealing with recognizing the exact emotion aroused when exposed to certain visual stimuli. Main gaps tackled in this work include (1) lack of large-scale image datasets for deep learning of visual emotions and (2) lack of context-sensitive single-modality approaches in emotion analysis in the still image domain. In this paper, we introduce LUCFER (Pronounced LU-CI-FER), a dataset containing over 3.6M images, with 3-dimensional labels; i.e. emotion, context and valence. LUCFER, the largest dataset of the kind currently available, is collected using a novel data collection pipeline, proposed and implemented in this work. Moreover, we train a context-sensitive deep classifier using a novel multinomial classification technique proposed here via adding a dimensionality reduction layer to the CNN. Relying on our categorical approach to emotion recognition, we claim and show empirically that injecting context to our unified training process helps (1) achieve a more balanced precision and recall, and (2) boost performance, yielding an overall classification accuracy of 73.12% compared to 58.3% achieved in the closest work in the literature.
机译:近年来,由于网络上可用的大量社交媒体内容,静止图像情感识别已受到越来越多的关注。观点挖掘,视觉情感分析,搜索和检索等都是应用领域,仅举几例。尽管存在有关该主题的作品,但提供了检测图像情感的方法;即,识别图像的极性,较少的精力集中在情感分析上;即处理识别暴露于某些视觉刺激时引起的确切情绪。这项工作解决的主要差距包括:(1)缺乏用于深度学习视觉情感的大规模图像数据集;(2)在静止图像领域的情感分析中缺乏上下文相关的单模态方法。在本文中,我们介绍LUCFER(发音为LU-CI-FER),该数据集包含超过360万张图像,并带有3维标签;即情感,背景和效价。 LUCFER是目前可用的最大的数据集,它是使用新颖的数据收集管道收集的,该数据收集管道是在这项工作中提出并实施的。此外,我们通过在CNN上添加降维层,使用此处提出的新型多项式分类技术训练上下文相关的深度分类器。依靠我们对情感识别的分类方法,我们声称并凭经验表明,将上下文注入我们的统一训练过程有助于(1)达到更加平衡的精度和召回率,以及(2)提高性能,总体分类精度为73.12 \%与文献中最接近的文献中的58.3%相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号