首页> 外文会议>International Conference on Network Infrastructure and Digital Content >Feature Fusion of Speech Emotion Recognition Based on Deep Learning
【24h】

Feature Fusion of Speech Emotion Recognition Based on Deep Learning

机译:基于深度学习的语音情感识别特征融合

获取原文

摘要

Speech emotion recognition (SER) is a hot topic in academia. One of the key issues in improving the performance of SER systems is the choice of speech emotion features. In order to establish a robust speech emotion recognition system, it is essential to select the features which can be a perfect representation of speech emotion attributes. Researchers has done a lot of work, proposed a variety of emotional features and made great progress. Although each kind of features were proven to be effective, most of methods are based on a single type. In this paper, we proposed a method of feature fusion based on deep learning, combining spectral-based features and pitch-based hyper-prosodic features. The experiments show that this method improves the performance of speech emotion recognition system.
机译:语音情感识别(SER)是学术界的热门话题。改善SER系统性能的关键问题之一是语音情感特征的选择。为了建立鲁棒的语音情感识别系统,必须选择可以完美表达语音情感属性的特征。研究人员做了很多工作,提出了多种情感特征并取得了长足的进步。尽管每种功能都被证明是有效的,但是大多数方法都基于一种类型。在本文中,我们提出了一种基于深度学习的特征融合方法,将基于频谱的特征和基于音高的超韵律特征相结合。实验表明,该方法提高了语音情感识别系统的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号