首页> 外文会议>AAAI Conference on Artificial Intelligence >Using Co-Captured Face, Gaze and Verbal Reactions to Images of Varying Emotional Content for Analysis and Semantic Alignment

【24h】

Using Co-Captured Face, Gaze and Verbal Reactions to Images of Varying Emotional Content for Analysis and Semantic Alignment

机译：使用共同捕获的面部，凝视和口头反应对分析和语义对齐的不同情绪内容的图像

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Analyzing different modalities of expression can provide insights into the ways that humans interpret, label, and react to images. Such insights have the potential not only to advance our understanding of how humans coordinate these expressive modalities but also to enhance existing methodologies for common AI tasks such as image annotation and classification. We conducted an experiment that co-captured the facial expressions, eye movements, and spoken language data that observers produce while examining images of varying emotional content and responding to description-oriented vs. affect-oriented questions about those images. We analyzed the facial expressions produced by the observers in order to determine the connection between those expressions and an image's emotional content. We also explored the relationship between the valence of an image and the verbal responses to that image, and how that relationship relates to the nature of the prompt, using low-level lexical features and more complex affective features extracted from the observers' verbal responses. Finally, in order to integrate this multimodal data, we extended an existing bitext alignment framework to create meaningful pairings between narrated observations about images and the image regions indicated by eye movement data. The resulting annotations of image regions with words from observers' responses demonstrate the potential of bitext alignment for multimodal data integration and, from an application perspective, for annotation of open-domain images. In addition, we found that while responses to affect-oriented questions appear useful for image understanding, their holistic nature seems less helpful for image region annotation.

机译：分析不同的表达方式可以为人类解释，标签和对图像作出反应的方式提供见解。这种见解有可能不仅要推进人类如何协调这些表现力的方式，而且还可以增强现有的AI任务，例如图像注释和分类。我们进行了一个实验，该实验将观察者在检查各种情绪内容的图像的同时共同捕获了观察者产生的口语数据和口语数据，并响应有关这些图像的面向描述的与影响的对问题的问题。我们分析了观察者产生的面部表情，以确定这些表达方式与图像的情感内容之间的联系。我们还探讨了图像的价值与对该图像的口头反应之间的关系，以及如何使用低级词汇特征和从观察者口头反应中提取的更复杂的情感特征与提示的性质有关。最后，为了集成这种多模式数据，我们扩展了现有的BITEXT对准框架，以在关于图像和眼球移动数据所示的图像区域和图像区域之间产生有意义的配对。从观察者响应中的单词产生的图像区域的注释证明了对多模式数据集成的BITEXT对准的潜力，并且从应用程序角度来看，用于注释开放域图像。此外，我们发现，虽然对受影响的问题的反应看起来很有用的图像理解，但它们的整体性质对图像区域注释似乎不太有用。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2017年|1034p|共7页
会议地点
作者
Aliya Gangji; Trevor Walden; Preethi Vaidyanathan; Emily Prudhommeaux; Reynold Bailey; Cecilia O. Alm;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
专利

1. Age Dhanges of Semantic Composition of Associative Reactions and Effect on Them of Emotional Sign of the Verbal Stimulus [J] . A.S.Pakhomova Journal of Evolutionary Biochemistry and Physiology: A Journal of Original Papers and Reviews on Evolutionary, Comparative, and Ecological Aspects of Physiology, Biochemistry, and Morphology . 2003,第1期

机译：联想反应的语义组成的年龄段及其对言语刺激情绪符号的影响
2. An Experimental Analysis of Young Women's Attitude Toward the Male Gaze Following Exposure to Centerfold Images of Varying Explicitness [J] . Paul J. Wright, Analisa Arroyo, Soyoung Bae Communication Reports . 2015,第1a2期

机译：暴露于不同显性中心折叠图像后年轻女性对男性凝视态度的实验分析
3. Emotion and self in psychotic disorders: Behavioral evidence from an emotional evaluation task using verbal stimuli varying in emotional valence and self-reference [J] . Herbert Cornelia, Hesse Klaus, Wildgruber Dirk Journal of behavior therapy and experimental psychiatry . 2018,第期

机译：精神病患者的情感和自我：来自情绪评估任务的行为证据，使用言语刺激在情感化价和自我参考中变化
4. Using Co-Captured Face, Gaze and Verbal Reactions to Images of Varying Emotional Content for Analysis and Semantic Alignment [C] . Aliya Gangji, Trevor Walden, Preethi Vaidyanathan, AAAI Conference on Artificial Intelligence . 2017

机译：使用共同捕获的面部，凝视和口头反应对分析和语义对齐的不同情绪内容的图像
5. Visual-Linguistic Semantic Alignment: Fusing Human Gaze and Spoken Narratives for Image Region Annotation [D] . Vaidyanathan, Preethi. 2017

机译：视觉语言语义对齐：融合人类凝视和语音叙事的图像区域注释
6. Fuzzy Emotional Semantic Analysis and Automated Annotation of Scene Images [O] . Jianfang Cao, Lichao Chen 2015

机译：场景图像的模糊情感语义分析和自动标注
7. Automatic Content-Based Temporal Alignment of Image Sequences with Varying Spatio-Temporal Resolution [O] . Samuel R. Ogden, A. Goodrich, Sean C. Warnick 2012

机译：基于不同时空分辨率的图像序列自动基于内容的时间对齐

Using Co-Captured Face, Gaze and Verbal Reactions to Images of Varying Emotional Content for Analysis and Semantic Alignment

摘要

著录项

相似文献

相关主题

期刊订阅