Emotion Recognition on large video dataset based on Convolutional Feature Extractor and Recurrent Neural Network

机译：基于卷积特征提取器和经常性神经网络的大型视频数据集的情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For many years, the emotion recognition task has remained one of the most interesting and important problems in the field of human-computer interaction. In this study, we consider the emotion recognition task as a classification as well as a regression task by processing encoded emotions in different datasets using deep learning models. Our model combines a convolutional neural network (CNN) with recurrent neural network (RNN) to predict dimensional emotions on video data. In the first step, CNN extracts feature vectors from video frames. In the second step, we fed these feature vectors to train RNN for exploiting the temporal dynamics of video. Furthermore, we analyzed how each neural network contributes to the sys-tem's overall performance. The experiments are performed on publicly available datasets including the largest modern Aff-Wild2 database. It contains over sixty hours of video data. We discovered the problem of overfitting of the model on an unbalanced dataset with an illustrative example using confusion matrices. The problem is solved by downsampling technique to balance the dataset. By significantly decreasing training data, we balance the dataset, thereby, the overall performance of the model is improved. Hence, the study qualitatively describes the abilities of deep learning models exploring enough amount of data to predict facial emotions. Our proposed method is implemented using Tensorflow Keras. The code is publicly available in the repository^{1^{1https://github.com/DenisRang/Combined-CNN-RNN-for-emotion-recognition.}}

机译：多年来，情感识别任务仍然是人机互动领域最有趣和最重要的问题之一。在这项研究中，我们将情绪识别任务视为通过使用深层学习模型处理不同数据集中的编码情绪的分类以及回归任务。我们的模型将卷积神经网络（CNN）与经常性神经网络（RNN）相结合，以预测视频数据的尺寸情绪。在第一步中，CNN从视频帧中提取特征向量。在第二步中，我们馈送这些特征向量来训练RNN以利用视频的时间动态。此外，我们分析了每个神经网络如何如何促进系统的整体性能。该实验是对公共数据集进行的，包括最大的现代AFF-Wild2数据库。它包含超过六十小时的视频数据。我们发现使用混淆矩阵的说明性示例在不平衡数据集上过度地过度的问题。通过下采样技术解决问题以平衡数据集。通过显着减少训练数据，我们平衡数据集，从而提高了模型的整体性能。因此，该研究定性地描述了深度学习模型的能力，探讨了足够量数据以预测面部情绪。我们所提出的方法是使用Tensorflow Keras实现的。代码在存储库中公开使用^{1 ^{1 https://github.com/denisrang/combined-cnn-rnn-for-emotion-recognition。}}

著录项

来源
《IEEE International Conference on Image Processing, Applications and Systems》|2020年|14-20|共7页
会议地点
作者
Denis Rangulov; Muhammad Fahim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Emotion recognition; Adaptation models; Recurrent neural networks; Databases; Predictive models; Feature extraction; Task analysis;

机译：情绪识别;适应模型;经常性神经网络;数据库;预测模型;特征提取;任务分析;

相似文献

外文文献
中文文献
专利

1. MotifCNN-fold: protein fold recognition based on fold-specific features extracted by motif-based convolutional neural networks [J] . Chen-Chen Li, Bin Liu Briefings in bioinformatics . 2020,第6期

机译：MOTIFCN-FORD：蛋白质折叠识别基于由基于主基的卷积神经网络提取的折叠特异性特征
2. Real-time video based emotion recognition using convolutional neural network and transfer learning [J] . J Sujanaa, S Palanivel Indian Journal of Science and Technology . 2020,第31期

机译：基于实时视频的情感识别使用卷积神经网络和转移学习
3. Three-dimensional feature maps and convolutional neural network-based emotion recognition [J] . Xiangwei Zheng, Xiaomei Yu, Yongqiang Yin, International Journal of Intelligent Systems . 2021,第11期

机译：三维特征地图和基于卷积神经网络的情感识别
4. End-to-end speech emotion recognition using 3-d convolutional recurrent neural networks based on modulation spectral features [C] . Zhichao Peng, Zhi Zhu, Masashi Unoki, 日本音響学会2018年春季研究発表会講演論文集 . 2018

机译：基于调制谱特征的3-d卷积递归神经网络端到端语音情感识别
5. Identifying Sports Players in Broadcast Videos Using Recurrent and Convolutional Neural Networks [D] . Chan, Alvin. 2018

机译：使用反复和卷积神经网络识别广播视频中的体育运动者
6. Gender Recognition from Human-Body Images Using Visible-Light and Thermal Camera Videos Based on a Convolutional Neural Network for Image Feature Extraction [O] . Dat Tien Nguyen, Ki Wan Kim, Hyung Gil Hong, 2017

机译：基于卷积神经网络的可见光和热成像摄像机视频对人体图像的性别识别
7. Accurate EEG-Based Emotion Recognition on Combined Features Using Deep Convolutional Neural Networks [O] . J. X. Chen, P. W. Zhang, Z. J. Mao, 2019

机译：基于精确的基于EEG的情感识别，使用深度卷积神经网络

Emotion Recognition on large video dataset based on Convolutional Feature Extractor and Recurrent Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅