A Multi-Modal Approach to Emotion Recognition using Undirected Topic Models

机译：一种使用无向主题模型的情感识别的多模态方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A multi-modal framework for emotion recognition using bag-of-words features and undirected, replicated softmax topic models is proposed here. Topic models ignore the temporal information between features, allowing them to capture the complex structure without a brute-force collection of statistics. Experiments are performed over face, speech and language features extracted from the USC IEMOCAP database. Performance on facial features yields an unweighted average recall of 60.71%, a relative improvement of 8.89% over state-of-the-art approaches. A comparable performance is achieved when considering only speech (57.39%) or a fusion of speech and face information (66.05%). Individually, each source is shown to be strong at recognizing either sadness (speech) or happiness (face) or neutral (language) emotions, while, a multi-modal fusion retains these properties and improves the accuracy to 68.92%. Implementation time for each source and their combination is provided. Results show that a turn of 1 second duration can be classified in approximately 666.65ms, thus making this method highly amenable for real-time implementation.

机译：在此提出了一种使用单词袋功能和无向复制的SoftMax主题模型的情感识别的多模态框架。主题模型忽略功能之间的时间信息，允许它们捕获复杂结构而不存在统计数据集。实验在USC IEMocap数据库中提取的面部，语音和语言功能上进行。面部特征的性能产生了60.71％的未加权平均召回，相对提高了最先进的方法8.89％。在考虑语音（57.39％）或言语和面部信息的融合时，可以实现相当的性能（66.05％）。单独地，每个来源都被认为是坚强的识别悲伤（语音）或幸福（面部）或中性（语言）情绪，而多模态融合保留这些性质，并将准确性提高到68.92％。提供了每个源和它们的组合的实现时间。结果表明，转弯1秒钟可以在大约666.65ms中进行分类，从而使该方法高度适用于实时实现。

著录项

来源
《IEEE International Symposium on Circuits and Systems》|2014年||共4页
会议地点
作者
Mohit Shah; Chaitali Chakrabarti; Andreas Spanias;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电工技术;
关键词

相似文献

外文文献
中文文献
专利

1. 一种新的基于多核学习特征融合方法的语音情感识别方法 [J] . 金赟, 宋鹏, 郑文明, 东南大学学报（英文版） . 2013,第002期
2. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [J] . Santiago-OmarCaballero-Morales ScientificWorldJournal . 2013,第3期

机译：墨西哥西班牙语演讲的情绪：一种基于情感特定元音声学建模的方法
3. Within and cross-corpus speech emotion recognition using latent topic model-based features [J] . Mohit Shah, Chaitali Chakrabarti, Andreas Spanias EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：使用基于潜在主题模型的特征进行内部和跨主体语音情感识别
4. 'Very much like any other Japanese RPG you've ever played': Using undirected topic modelling to examine the evolution of JRPGs' presence in anglophone web publications [J] . Pelletier-Gagnon Jérémie Journal of gaming & virtual worlds . 2018,第2期

机译：“非常类似于您曾经玩过的任何其他日本RPG”：使用无方向主题建模来检查JRPG在英语网络出版物中的存在演变
5. A multi-modal approach to emotion recognition using undirected topic models [C] . Shah Mohit, Chakrabarti Chaitali, Spanias Andreas IEEE International Symposium on Circuits and Systems . 2014

机译：使用非定向主题模型的多模式情感识别方法
6. A multi-modal approach for face modeling and recognition [D] . Mahoor, Mohammad Hossein 2007

机译：人脸建模和识别的多模式方法
7. EEG-Based Multi-Modal Emotion Recognition using Bag of Deep Features: An Optimal Feature Selection Approach [O] . Muhammad Adeel Asghar, Muhammad Jamil Khan, Fawad, 2019

机译：基于深度特征袋的基于脑电图的多模态情绪识别：一种最优特征选择方法
8. MPED: A Multi-Modal Physiological Emotion Database for Discrete Emotion Recognition [O] . Tengfei Song, Wenming Zheng, Cheng Lu, 2019

机译：用于离散情感识别的多模态生理情感数据库

A Multi-Modal Approach to Emotion Recognition using Undirected Topic Models

摘要

著录项

相似文献

相关主题

期刊订阅