Audio-Visual Emotion Recognition System for Variable Length Spatio-Temporal Samples Using Deep Transfer-Learning

机译：深度传递学习的可变长度时空样本视听情感识别系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic Emotion recognition is renowned for being a difficult task, even for human intelligence. Due to the importance of having enough data in classification problems, we introduce a framework developed with the purpose of generating labeled audio to create our own database. In this paper we present a new model for audio-video emotion recognition using Transfer Learning (TL). The idea is to combine a pre-trained high level feature extractor Convolutional Neural Network (CNN) and a Bidirectional Recurrent Neural Network (BRNN) model to address the issue of variable sequence length inputs. Throughout the design process we discuss the main problems related to the high complexity of the task due to its inherent subjective nature and, on the other hand, the important results obtained by testing the model on different databases, outperforming the state-of-the-art algorithms in the SAVEE [3] database. Furthermore, we use the mentioned application to perform precision classification (per user) into low resources real scenarios with promising results.

机译：自动情感识别以一项艰巨的任务而闻名，即使对于人类智能也是如此。由于在分类问题中拥有足够的数据非常重要，因此我们引入了一个框架，该框架旨在生成标记的音频来创建我们自己的数据库。在本文中，我们提出了一种使用转移学习（TL）的音视频情感识别的新模型。这个想法是将预训练的高级特征提取器卷积神经网络（CNN）和双向递归神经网络（BRNN）模型相结合，以解决可变序列长度输入的问题。在整个设计过程中，由于其固有的主观性，我们讨论了与任务的高复杂性有关的主要问题，另一方面，我们讨论了通过在不同数据库上测试模型而获得的重要结果，其结果优于现状SAVEE [3]数据库中的艺术算法。此外，我们使用提到的应用程序对每个资源进行精确分类（每个用户），以实现有希望的结果的低资源实际情况。

著录项

来源
《International Conference on Business Information Systems》|2020年|434-446|共13页
会议地点
作者
Antonio Cano Montes; Luis A. Hernandez Gomez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Emotion recognition; Multimodal deep learning; Transfer learning; Variable sequence length; Model fusion; Convolutional neural network; Bidirectional recurrent neural network;

机译：情绪识别;多模式深度学习;转移学习;可变序列长度;模型融合;卷积神经网络双向递归神经网络;

相似文献

外文文献
中文文献
专利

1. An Audio-Visual Emotion Recognition System Using Deep Learning Fusion for a Cognitive Wireless Framework [J] . M. Shamim Hossain, Ghulam Muhammad IEEE Wireless Communications . 2019,第3期

机译：基于深度学习融合的认知无线框架视听情感识别系统
2. Leveraging recent advances in deep learning for audio-Visual emotion recognition [J] . Schoneveld Liam, Othmani Alice, Abdelkawy Hazem Pattern recognition letters . 2021,第Juna期

机译：利用最近的视听情感认可深度学习的进步
3. Emotion recognition using deep learning approach from audio-visual emotional big data [J] . Hossain M. Shamim, Muhammad Ghulam Information Fusion . 2019,第期

机译：从视听情绪大数据使用深度学习方法的情感认可
4. Deep Learning Based Video Spatio-Temporal Modeling for Emotion Recognition [C] . Ruben D. Fonnegra, Gloria M. Diaz International conference on human-computer interaction . 2018

机译：基于深度学习的情绪识别视频时空建模
5. Deep Architectures for Spatio-Temporal Sequence Recognition with Applications in Automatic Seizure Detection [D] . Golmohammadi, Meysam. 2021

机译：用于自动癫痫发作检测中的应用的太平架构
6. Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [O] . Tursunov Anvarjon, Mustaqeem, Soonil Kwon 2020

机译：深网络：使用深频特征的基于轻量级CNN的语音情感识别系统
7. Leveraging recent advances in deep learning for audio-Visual emotion recognition [O] . Liam Schoneveld, Alice Othmani, Hazem Abdelkawy 2021

机译：利用最近的视听情感认可深度学习的进步

Audio-Visual Emotion Recognition System for Variable Length Spatio-Temporal Samples Using Deep Transfer-Learning

摘要

著录项

相似文献

相关主题

期刊订阅