首页> 外文会议>IEEE International Symposium on Signal Processing and Information Technology >Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks
【24h】

Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks

机译:基于调制频谱特征池和深度神经网络的移动设备语音情感识别

获取原文

摘要

In this study, the problem of speech emotion recognition (SER) in-the-wild is addressed. A new modulation spectral feature pooling scheme is proposed to mitigate the detrimental effects of background noise. On top of these features, two DNN-based architectures are tested for the prediction of arousal and valence emotional primitives: a multi-layer perceptron (MLP) and a recurrent neural network based on Long-Short Term Memory (LSTM). Experiments are conducted using the RECOLA dataset of spontaneous interactions. In order to simulate data collected in-the-wild, the clean speech files were corrupted with different levels of background noise and room impulse responses collected using a mobile device. Both stationary and non-stationary noise types (fan and babble) were considered in our experiments. Three distinct scenarios were explored: noise only, reverberation only and noise-plus-reverberation. Experimental results have shown that, in most of the scenarios, the proposed SER system achieved better performance in terms of concordance correlation coefficients (CCC) compared to the benchmark algorithm described in the 2016 Audio/Visual Emotion Challenge. The proposed feature system also showed to be more robust when noise-plus-reverberation is considered.
机译:在这项研究中,解决了野外的语音情感识别(SER)问题。提出了一种新的调制频谱特征池化方案,以减轻背景噪声的不利影响。除了这些功能之外,还测试了两种基于DNN的架构来预测唤醒和化合情绪原语:多层感知器(MLP)和基于长短期记忆(LSTM)的递归神经网络。使用RECOLA自发相互作用的数据集进行实验。为了模拟在野外收集的数据,使用不同级别的背景噪声和使用移动设备收集的房间冲动响应来破坏干净的语音文件。在我们的实验中考虑了固定和非固定噪声类型(风扇和ba声)。探索了三种不同的情况:仅噪声,仅混响和噪声加混响。实验结果表明,在大多数情况下,与2016年影音情感挑战赛中描述的基准算法相比,所提出的SER系统在一致性相关系数(CCC)方面取得了更好的性能。当考虑噪声加混响时,建议的特征系统也显示出更强大的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号