Modeling Naturalistic Affective States Via Facial, Vocal, and Bodily Expressions Recognition

机译：通过面部，声乐和身体表达识别建模自然主义情感状态

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Affective and human-centered computing have attracted a lot of attention during the past years, mainly due to the abundance of devices and environments able to exploit multimodal input from the part of the users and adapt their functionality to their preferences or individual habits. In the quest to receive feedback from the users in an unobtrusive manner, the combination of facial and hand gestures with prosody information allows us to infer the users' emotional state, relying on the best performing modality in cases where one modality suffers from noise or bad sensing conditions. In this paper, we describe a multi-cue, dynamic approach to detect emotion in naturalistic video sequences. Contrary to strictly controlled recording conditions of audiovisual material, the proposed approach focuses on sequences taken from nearly real world situations. Recognition is performed via a 'Simple Recurrent Network' which lends itself well to modeling dynamic events in both user's facial expressions and speech. Moreover this approach differs from existing work in that it models user expressivity using a dimensional representation of activation and valence, instead of detecting discrete 'universal emotions', which are scarce in everyday human-machine interaction. The algorithm is deployed on an audiovisual database which was recorded simulating human-human discourse and, therefore, contains less extreme expressivity and subtle variations of a number of emotion labels.

机译：情感和以人为本的计算在过去几年中引起了很多关注，主要是由于能够从用户部分利用多模式输入的设备和环境的丰富，并适应他们的偏好或个人习惯。在寻求以不引人注目的方式接收用户的反馈意见中，面部和手势与韵律信息的组合使我们能够推断用户的情绪状态，依赖于一个模态遭受噪音或坏的情况下最好的表现形态传感条件。在本文中，我们描述了一种多重线，动态方法来检测自然视频序列中的情绪。与受音像材料的严格控制的记录条件相反，所提出的方法侧重于从几乎现实世界情况所采取的序列。通过“简单复制网络”执行识别，其对用户的面部表情和语音中的动态事件进行了良好的建模。此外，这种方法与现有的工作不同，因为它模拟了使用激活和价值的尺寸表示的用户表征，而不是检测在日常人机交互中稀缺的离散的“普遍情绪”。该算法部署在录制的视听数据库上，该数据库被记录在模拟人类话语中，因此，含有较少的极端表达性和许多情感标签的微妙变化。

著录项

来源
《International Conference on Multimodal Interfaces》|2007年||共22页
会议地点
作者
Kostas Karpouzis; George Caridakis; Loic Kessous; Noam Amir; Amaryllis Raouzaiou; Lori Malatesta; Stefanos Kollias;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
affective interaction; multimodal analysis; facial expressions; prosody; hand gestures; neural networks;

机译：情感相互作用;多式联数分析;面部表情;韵律;手势;神经网络;

相似文献

外文文献
中文文献
专利

1. Affective State Level Recognition in Naturalistic Facial and Vocal Expressions [J] . Cybernetics, IEEE Transactions on . 2014,第3期

机译：自然主义的面部表情和人声表达中的情感状态水平识别
2. Slowing Down Presentation of Facial Movements and Vocal Sounds Enhances Facial Expression Recognition and Induces Facial–Vocal Imitation in Children with Autism [J] . Carole Tardif, France Lainé, Mélissa Rodriguez, Journal of Autism and Developmental Disorders . 2007,第8期

机译：减慢面部运动和人声的表现，可增强自闭症儿童的面部表情识别能力并诱导其面部-声音模仿
3. The temporal dynamics of processing emotions from vocal, facial, and bodily expressions. [J] . Jessen S, Kotz SA NeuroImage . 2011,第2期

机译：通过声音，面部和身体表情处理情绪的时间动态。
4. Modeling Naturalistic Affective States Via Facial, Vocal, and Bodily Expressions Recognition [C] . Kostas Karpouzis, George Caridakis, Loic Kessous, International Conference on Multimodal Interfaces . 2007

机译：通过面部，声乐和身体表达识别建模自然主义情感状态
5. Automatic recognition of facial expressions using Hidden Markov models and estimation of expression intensity. [D] . Lien, Jenn-Jier James. 1998

机译：使用隐马尔可夫模型自动识别面部表情并估算表情强度。
6. Recognition Characteristics of Facial and Bodily Expressions: Evidence From ERPs [O] . Xiaoxiao Li 2021

机译：面部和身体表达的识别特征：来自ERPS的证据
7. Modeling Naturalistic Affective States Via Facial, Vocal, and Bodily Expressions Recognition [O] . Kostas Karpouzis, George Caridakis, Loic Kessous, 2009

机译：通过面部，人声和身体表情识别为自然主义情感状态建模

Modeling Naturalistic Affective States Via Facial, Vocal, and Bodily Expressions Recognition

摘要

著录项

相似文献

相关主题

期刊订阅