首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of Visual Emotions
【24h】

LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of Visual Emotions

机译:Lucifer:一个大规模的上下文敏感图像数据集,用于深入学习视觉情绪

获取原文

摘要

Still image emotion recognition has been receiving increasing attention in recent years due to the tremendous amount of social media content available on the Web. Opinion mining, visual emotion analysis, search and retrieval are among the application areas, to name a few. While there exist works on the subject, offering methods to detect image sentiment; i.e. recognizing the polarity of the image, less efforts focus on emotion analysis; i.e. dealing with recognizing the exact emotion aroused when exposed to certain visual stimuli. Main gaps tackled in this work include (1) lack of large-scale image datasets for deep learning of visual emotions and (2) lack of context-sensitive single-modality approaches in emotion analysis in the still image domain. In this paper, we introduce LUCFER (Pronounced LU-CI-FER), a dataset containing over 3.6M images, with 3-dimensional labels; i.e. emotion, context and valence. LUCFER, the largest dataset of the kind currently available, is collected using a novel data collection pipeline, proposed and implemented in this work. Moreover, we train a context-sensitive deep classifier using a novel multinomial classification technique proposed here via adding a dimensionality reduction layer to the CNN. Relying on our categorical approach to emotion recognition, we claim and show empirically that injecting context to our unified training process helps (1) achieve a more balanced precision and recall, and (2) boost performance, yielding an overall classification accuracy of 73.12% compared to 58.3% achieved in the closest work in the literature.
机译:由于网络上可用的巨大社交媒体内容,近年来,仍然是图像的静态影像识别越来越受到关注。意见采矿,视觉情绪分析,搜索和检索是应用领域,名称为少。虽然存在在主题上有效,但提供了检测图像情绪的方法;即识别图像的极性,少努力关注情绪分析;即,在暴露某些视觉刺激时,应识别确切的情绪。在这项工作中解决的主要差距包括(1)缺乏大规模图像数据集,用于深入学习视觉情绪和(2)缺乏图像域中的情感分析中的情感敏感的单片机方法。在本文中,我们介绍了Lucfer(发音Lu-CI-FER),数据集包含超过3.6M图像,具有三维标签;即情绪,背景和价值。 Lucfer是当前可用的最大数据集,使用新颖的数据收集管道收集,在这项工作中提出和实施。此外,我们使用这里提出的新型多项分类技术训练上下文敏感的深度分类器,通过向CNN添加维数减少层。我们依靠我们的情感认可方法,我们声称并展示了向我们统一的培训过程注入上下文,有助于(1)实现更平衡的精度和召回,(2)提升性能,产生73.12 %的整体分类准确性与文献中最近的工作中实现的58.3 %相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号