Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis

Loic Kessous; Ginevra Castellano; George Caridakis

首页> 外文期刊>Journal on multimodal user interfaces >Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis

【24h】

Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis

机译：基于表情，身体手势和声学分析的基于语音的交互中的多模式情感识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper a study on multimodal automatic emotion recognition during a speech-based interaction is presented. A database was constructed consisting of people pronouncing a sentence in a scenario where they interacted with an agent using speech. Ten people pronounced a sentence corresponding to a command while making 8 different emotional expressions. Gender was equally represented, with speakers of several different native languages including French, German, Greek and Italian. Facial expression, gesture and acoustic analysis of speech were used to extract features relevant to emotion. For the automatic classification of unimodal data, bimodal data and multimodal data, a system based on a Bayesian classifier was used. After performing an automatic classification of each modality, the different modalities were combined using a multimodal approach. Fusion of the modalities at the feature level (before running the classifier) and at the results level (combining results from classifier from each modality) were compared. Fusing the multimodal data resulted in a large increase in the recognition rates in comparison to the unimodal systems: the multimodal approach increased the recognition rate byrnmore than 10% when compared to the most successful unimodal system. Bimodal emotion recognition based on all combinations of the modalities (i.e., 'face-gesture', 'face-speech' and 'gesture-speech') was also investigated. The results show that the best pairing is 'gesture-speech'. Using all three modalities resulted in a 3.3% classification improvement over the best bimodal results.

机译：本文提出了一种基于语音的交互过程中多模式自动情感识别的研究。构建了一个数据库，该数据库由人们在使用语音与特工进行交互的情况下发音的句子组成。十个人在做出8种不同的情感表达时发音了与命令相对应的句子。性别平等地得到了代表，讲法语，德语，希腊语和意大利语的几种不同的母语。使用面部表情，手势和语音声学分析来提取与情感相关的特征。对于单峰数据，双峰数据和多峰数据的自动分类，使用了基于贝叶斯分类器的系统。在对每个形态进行自动分类之后，使用多形态方法将不同形态组合在一起。比较了特征级别（在运行分类器之前）和结果级别（来自每个模态的分类器结果组合）中的模态融合。与单峰系统相比，融合多峰数据导致识别率大大提高：与最成功的单峰系统相比，多峰方法将识别率提高了10％以上。还研究了基于模态的所有组合（即``面部手势''，``面部语音''和``手势语音''）的双峰情感识别。结果表明，最佳配对是“手势语音”。与最佳双峰结果相比，使用这三种模态均导致3.3％的分类改进。

著录项

来源
《Journal on multimodal user interfaces》 |2010年第2期|p.33-48|共16页
作者
Loic Kessous; Ginevra Castellano; George Caridakis;
展开▼
作者单位

30 chemin du Lancier, Marseille 13008, France;

Department of Computer Science, School of Electronic Engineering and Computer Science, Queen Mary University of London, Mile End Road, London E1 4NS, UK;

Image, Video and Multimedia Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
affective body language; affective speech; facial expression; emotion recognition; multimodal fusion;

机译：情感肢体语言;情感言论表情;情绪识别;多峰融合;

相似文献

外文文献
中文文献
专利

1. Integrating Facial Expression and Body Gesture in Videos for Emotion Recognition [J] . Jingjie YAN, Wenming ZHENG, Minhai XIN, IEICE transactions on information and systems . 2014,第3期

机译：在视频中集成面部表情和身体姿态以进行情感认可
2. MULTIMODAL COMPLEX EMOTIONS: GESTURE EXPRESSIVITY AND BLENDED FACIAL EXPRESSIONS [J] . JEAN-CLAUDE MARTIN, RADOSLAW NIEWIADOMSKI, LAURENCE DEVILLERS, International journal of humanoid robotics . 2006,第3期

机译：多模态复杂情绪：手势表达和混合面部表情
3. Multimodal Recognition of Emotions in Music and Facial Expressions [J] . Alice Mado Proverbio, Elisa Camporeale, Alessandra Brusa Frontiers in Human Neuroscience . 2020,第4期

机译：音乐与面部表情情绪的多式式识别
4. Emotion Recognition Using Feature-level Fusion of Facial Expressions and Body Gestures [C] . Tanya Keshari, Suja Palaniswamy International Conference on Communication and Electronics Systems . 2019

机译：使用面部表情和手势的特征级融合的情绪识别
5. Multimodal Emotion Recognition Using 3D Facial Landmarks, Action Units, and Physiological Data [D] . Fabiano, Diego . 2019

机译：使用3D面部地标，动作单位和生理数据的多模式情感识别
6. Adaptive user interface design and analysis using emotion recognition through facial expressions and body posture from an RGB-D sensor [O] . Selma Medjden, Naveed Ahmed, Mohammed Lataifeh 2020

机译：使用RGB-D传感器的面部表情和身体姿势使用情感识别的自适应用户界面设计和分析
7. Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis [O] . Kessous, L, Castellano, G, Caridakis, G 2010

机译：基于语音的交互中的多模态情感识别使用面部表情，身体姿势和声学分析

Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅