首页> 外文会议>International Conference on Business Information Systems >Audio-Visual Emotion Recognition System for Variable Length Spatio-Temporal Samples Using Deep Transfer-Learning
【24h】

Audio-Visual Emotion Recognition System for Variable Length Spatio-Temporal Samples Using Deep Transfer-Learning

机译:深度传递学习的可变长度时空样本视听情感识别系统

获取原文

摘要

Automatic Emotion recognition is renowned for being a difficult task, even for human intelligence. Due to the importance of having enough data in classification problems, we introduce a framework developed with the purpose of generating labeled audio to create our own database. In this paper we present a new model for audio-video emotion recognition using Transfer Learning (TL). The idea is to combine a pre-trained high level feature extractor Convolutional Neural Network (CNN) and a Bidirectional Recurrent Neural Network (BRNN) model to address the issue of variable sequence length inputs. Throughout the design process we discuss the main problems related to the high complexity of the task due to its inherent subjective nature and, on the other hand, the important results obtained by testing the model on different databases, outperforming the state-of-the-art algorithms in the SAVEE [3] database. Furthermore, we use the mentioned application to perform precision classification (per user) into low resources real scenarios with promising results.
机译:自动情感识别以一项艰巨的任务而闻名,即使对于人类智能也是如此。由于在分类问题中拥有足够的数据非常重要,因此我们引入了一个框架,该框架旨在生成标记的音频来创建我们自己的数据库。在本文中,我们提出了一种使用转移学习(TL)的音视频情感识别的新模型。这个想法是将预训练的高级特征提取器卷积神经网络(CNN)和双向递归神经网络(BRNN)模型相结合,以解决可变序列长度输入的问题。在整个设计过程中,由于其固有的主观性,我们讨论了与任务的高复杂性有关的主要问题,另一方面,我们讨论了通过在不同数据库上测试模型而获得的重要结果,其结果优于现状SAVEE [3]数据库中的艺术算法。此外,我们使用提到的应用程序对每个资源进行精确分类(每个用户),以实现有希望的结果的低资源实际情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号