Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method

机译：基于特征融合深度学习方法的视音频情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a video-audio based emotion recognition system in order to improve the successive classification rate. The features from audio frames are extracted using Mel frequency Cepstral coefficients (MFCC) while the features from video frames are extracted from VGG16 with pre-trained weights on the ImageNet dataset [17]. Then recurrent neural networks (RNN) are further applied to process the sequence information. The outputs of both RNN are fused into a concatenate layer and then the final classification result is obtained by the softmax layer. Our proposed system achieves 90% accuracy based on the RAVDESS dataset for eight emotion classes.

机译：为了提高后续分类率，本文提出了一种基于视频音频的情感识别系统。音频帧的特征使用Mel频率倒谱系数（MFCC）提取，而视频帧的特征则使用ImageNet数据集上预先训练的权重从VGG16中提取[17]。然后进一步应用递归神经网络（RNN）对序列信息进行处理。将两个RNN的输出融合成一个级联层，然后通过softmax层获得最终的分类结果。我们提出的系统在RAVDESS数据集的基础上达到了90%的准确率，用于八种情绪类别。

著录项

来源
《IEEE International Midwest Symposium on Circuits and Systems》|2021年|611-616|共6页
会议地点
作者
Yanan Song; Yuanyang Cai; Lizhe Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Deep learning; Emotion recognition; Recurrent neural networks; Circuits and systems; Life estimation; Feature extraction;

机译：训练深度学习;情绪识别;循环神经网络;电路和系统;寿命估计;特征提取;

相似文献

外文文献
中文文献
专利

1. Rare Bird Sparse Recognition via Part-Based Gist Feature Fusion and Regularized Intraclass Dictionary Learning [J] . Jixin Liu, Ning Sun, Xiaofei Li, 计算机、材料和连续体(英文) . 2018,第006期
2. Feature Fusion Based Hand Gesture Recognition Method for Automotive Interfaces [J] . XU Qianyi, QIN Guihe, SUN Minghui, 电子学报（英文版） . 2020,第006期
3. Feature fusion methods research based on deep belief networks for speech emotion recognition under noise condition [J] . Huang Yongming, Tian Kexin, Wu Ao, Journal of ambient intelligence and humanized computing . 2019,第5期

机译：基于深度信念网络的特征融合方法在噪声条件下的语音情感识别
4. Expression-EEG Bimodal Fusion Emotion Recognition Method Based on Deep Learning [J] . Yu Lu, Hua Zhang, Lei Shi, Computational and mathematical methods in medicine . 2021,第a期

机译：基于深度学习的表达-EEG双峰融合情绪识别方法
5. Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition [J] . Linhui Sun, Jia Chen, Keli Xie, International journal of speech technology . 2018,第4期

机译：基于深度卷积神经网络的深浅特征融合在语音情感识别中的应用
6. Feature Fusion of Speech Emotion Recognition Based on Deep Learning [C] . Gang Liu, Wei He, Bicheng Jin International Conference on Network Infrastructure and Digital Content . 2018

机译：基于深度学习的语音情感识别特征融合
7. Reducing Covariate Factors of Gait Recognition Using Feature Selection, Dictionary-Based Sparse Coding, and Deep Learning. [D] . Alotaibi, Munif. 2017

机译：使用特征选择，基于字典的稀疏编码和深度学习减少步态识别的协变量因素。
8. Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning [O] . Dong Liu, Zhiyong Wang, Lifeng Wang, 2021

机译：基于深度学习的语音表达多模态融合情绪识别方法
9. The Comparison of Fusion Methods for HSRRSI Considering the Effectiveness of Land Cover (Features) Object Recognition Based on Deep Learning [O] . Shiran Song, Jianhua Liu, Heng Pu, 2019

机译：考虑基于深度学习的土地覆盖（特征）对象识别有效性的HSRRSI融合方法比较
10. Graphical Geometric and Learning/Optimization-Based Methods in Statistical Signal and Image Processing Object Recognition and Data Fusion [R] . Willsky, A. S. 2008

机译：基于图形几何和学习/优化的统计信号和图像处理方法识别和数据融合方法

Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method

摘要

著录项

相似文献

相关主题

期刊订阅