首页> 外文会议>International Conference on Platform Technology and Service >Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network
【24h】

Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network

机译:深度卷积神经网络的频谱图语音情感识别

获取原文

摘要

This paper presents a method for speech emotion recognition using spectrograms and deep convolutional neural network (CNN). Spectrograms generated from the speech signals are input to the deep CNN. The proposed model consisting of three convolutional layers and three fully connected layers extract discriminative features from spectrogram images and outputs predictions for the seven emotions. In this study, we trained the proposed model on spectrograms obtained from Berlin emotions dataset. Furthermore, we also investigated the effectiveness of transfer learning for emotions recognition using a pre-trained AlexNet model. Preliminary results indicate that the proposed approach based on freshly trained model is better than the fine-tuned model, and is capable of predicting emotions accurately and efficiently.
机译:本文提出了一种使用频谱图和深度卷积神经网络(CNN)进行语音情感识别的方法。从语音信号生成的频谱图被输入到深度CNN。所提出的模型由三个卷积层和三个完全连接的层组成,从频谱图图像中提取判别特征,并输出对七种情绪的预测。在这项研究中,我们在从柏林情绪数据集获得的频谱图上训练了提出的模型。此外,我们还使用预先训练的AlexNet模型研究了转移学习对情绪识别的有效性。初步结果表明,所提出的基于新近训练模型的方法优于微调模型,并且能够准确有效地预测情绪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号