首页> 外文期刊>Information Fusion >Emotion recognition using deep learning approach from audio-visual emotional big data
【24h】

Emotion recognition using deep learning approach from audio-visual emotional big data

机译:从视听情绪大数据使用深度学习方法的情感认可

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an emotion recognition system using a deep learning approach from emotional Big Data. The Big Data comprises of speech and video. In the proposed system, a speech signal is first processed in the frequency domain to obtain a Mel-spectrogram, which can be treated as an image. Then this Mel-spectrogram is fed to a convolutional neural network (CNN). For video signals, some representative frames from a video segment are extracted and fed to the CNN. The outputs of the two CNNs are fused using two consecutive extreme learning machines (ELMs). The output of the fusion is given to a support vector machine (SVM) for final classification of the emotions. The proposed system is evaluated using two audio-visual emotional databases, one of which is Big Data. Experimental results confirm the effectiveness of the proposed system involving the CNNs and the ELMs.
机译:本文提出了一种从情绪大数据的深度学习方法的情感识别系统。 大数据包括语音和视频。 在所提出的系统中,首先在频域中处理语音信号以获得熔点,其可以被视为图像。 然后将该熔点谱图送入卷积神经网络(CNN)。 对于视频信号,从视频段中提取并馈送到CNN的一些代表帧。 两个CNN的输出使用两个连续的极限学习机(ELM)融合。 融合的输出被给予支持向量机(SVM),用于最终的情绪分类。 使用两个视听情绪数据库来评估所提出的系统,其中一个是大数据。 实验结果证实了涉及CNN和ELM的提出系统的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号