首页> 外文会议>IEEE International Conference on Multimedia and Expo >Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition
【24h】

Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition

机译:具有注意机制的深度卷积递归神经网络用于鲁棒语音情感识别

获取原文

摘要

We present a deep convolutional recurrent neural network for speech emotion recognition based on the log-Mel filterbank energies, where the convolutional layers are responsible for the discriminative feature learning. Based on the hypothesis that a better understanding of the internal configuration within an utterance would help reduce misclassification, we further propose a convolutional attention mechanism to learn the utterance structure relevant to the task. In addition, we quantitatively measure the performance gain contributed by each module in our model in order to characterize the nature of emotion expressed in speech. The experimental results on the eNTERFACE'05 emotion database validate our hypothesis and also demonstrate an absolute improvement by 4.62% compared to the state-of-the-art approach.
机译:我们提出基于对数-梅尔滤波器组能量的深度卷积递归神经网络,用于语音情感识别,其中卷积层负责判别特征学习。基于这样的假设,即更好地理解发声内的内部配置将有助于减少错误分类,我们进一步提出了卷积注意机制来学习与任务相关的发声结构。另外,我们定量测量模型中每个模块所贡献的性能,以表征语音表达的情绪的本质。在eNTERFACE'05情感数据库上的实验结果验证了我们的假设,并且与最先进的方法相比,绝对值提高了4.62%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号