首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Speech emotion recognition using convolutional long short-term memory neural network and support vector machines
【24h】

Speech emotion recognition using convolutional long short-term memory neural network and support vector machines

机译:使用卷积长短期记忆神经网络和支持向量机的语音情感识别

获取原文
获取外文期刊封面目录资料

摘要

In this paper, we propose a speech emotion recognition technique using convolutional long short-term memory (LSTM) recurrent neural network (ConvLSTM-RNN) as a phoneme-based feature extractor from raw input speech signal. In the proposed technique, ConvLSTM-RNN outputs phoneme- based emotion probabilities to every frame of an input utterance. Then these probabilities are converted into statistical features of the input utterance and used for the input features of support vector machines (SVMs) or linear discriminant analysis (LDA) system to classify the utterance-level emotions. To assess the effectiveness of the proposed technique, we conducted experiments in the classification of four emotions (anger, happiness, sadness, and neutral) on IEMOCAP database. The result showed that the proposed technique with either of SVM or LDA classifier outperforms the conventional ConvLSTM-based one.
机译:在本文中,我们提出了一种使用卷积长短期记忆(LSTM)递归神经网络(ConvLSTM-RNN)作为从原始输入语音信号中基于音素的特征提取器的语音情感识别技术。在提出的技术中,ConvLSTM-RNN将基于音素的情绪概率输出到输入话语的每一帧。然后将这些概率转换为输入话语的统计特征,并用于支持向量机(SVM)或线性判别分析(LDA)系统的输入特征,以对话语级别的情绪进行分类。为了评估所提出技术的有效性,我们在IEMOCAP数据库中进行了四种情绪(愤怒,幸福,悲伤和中立)分类的实验。结果表明,采用SVM或LDA分类器的拟议技术优于传统的基于ConvLSTM的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号