An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information

机译：一种用于多模式情感识别的端到端学习方法：提取公共和私人信息

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Multimodal emotion recognition is important for facilitating efficient interaction between humans and machines. To better detect emotional states from multimodal data, we need to effectively extract both the common information that captures dependencies among different modalities, and the private information that characterizes variations in each modality. However, existing works are mostly designed to pursue either one of these objectives but not both. In our work, we propose an end-to-end learning approach to simultaneously extract the common and private information for multimodal emotion recognition. Specifically, we use a correlation loss based on Hirschfeld-Gebelein-Renyi (HGR) maximal correlation and a reconstruction loss based on autoencoders to preserve the common and private information, respectively. Experimental results on eNTERFACE'05 database and RML database demonstrate the effectiveness of our proposed approach.

机译：多模式情感识别对于促进人与机器之间的有效交互非常重要。为了更好地从多模态数据中检测情绪状态，我们需要有效地提取捕获不同模态之间依存关系的公共信息，以及表征每种模态变化的私人信息。但是，现有作品大多旨在实现这些目标之一，而不是两者兼而有之。在我们的工作中，我们提出了一种端到端的学习方法，可以同时提取用于多模式情感识别的公共和私人信息。具体来说，我们使用基于Hirschfeld-Gebelein-Renyi（HGR）最大相关性的相关损失和基于自动编码器的重构损失来分别保留公共信息和私有信息。在eNTERFACE'05数据库和RML数据库上的实验结果证明了我们提出的方法的有效性。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|1144-1149|共6页
会议地点
作者
Fei Ma; Wei Zhang; Yang Li; Shao-Lun Huang; Lin Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Correlation; Feature extraction; Emotion recognition; Data mining; Databases; Visualization; Task analysis;

机译：相关性;特征提取;情感识别;数据挖掘;数据库;可视化;任务分析;

相似文献

外文文献
中文文献
专利

1. End-to-End Multimodal Emotion Recognition Using Deep Neural Networks [J] . Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：使用深度神经网络的端到端多模式情感识别
2. Emotion recognition using multimodal deep learning in multiple psychophysiological signals and video [J] . Wang Zhongmin, Zhou Xiaoxiao, Wang Wenlang, International journal of machine learning and cybernetics . 2020,第4期

机译：使用多模式深度学习在多种心理生理信号和视频中进行情绪识别
3. Metric Learning-Based Multimodal Audio-Visual Emotion Recognition [J] . Ghaleb Esam, Popa Mirela, Asteriadis Stylianos IEEE multimedia . 2020,第1期

机译：基于度量学习的多峰视听情感识别
4. AN END-TO-END LEARNING APPROACH FOR MULTIMODAL EMOTION RECOGNITION: EXTRACTING COMMON AND PRIVATE INFORMATION [C] . Fei Ma, Wei Zhang, Yang Li, IEEE International Conference on Multimedia and Expo . 2019

机译：多模式情感识别的端到端学习方法：提取普通和私人信息
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. DEEP MULTIMODAL LEARNING FOR EMOTION RECOGNITION IN SPOKEN LANGUAGE [O] . Yue Gu, Shuhong Chen, Ivan Marsic -1

机译：语音识别中的深度多模态学习
7. End-to-End Multimodal Emotion Recognition using Deep Neural Networks [O] . Tzirakis, Trigeorgis, Nicolaou, 2017

机译：基于深度神经网络的端到端多模态情感识别

An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅