首页> 外文期刊>Software >Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services
【24h】

Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services

机译:深度学习和基于SVM的中文语音情感识别技术可提供智能情感服务

获取原文
获取原文并翻译 | 示例
       

摘要

Emotion recognition is challenging for understanding people and enhances human-computer interaction experiences, which contributes to the harmonious running of smart health care and other smart services. In this paper, several kinds of speech features such as Mel frequency cepstrum coefficient, pitch, and formant were extracted and combined in different ways to reflect the relationship between feature fusions and emotion recognition performance. In addition, we explored two methods, namely, support vector machine (SVM) and deep belief networks (DBNs), to classify six emotion status: anger, fear, joy, neutral status, sadness, and surprise. In the SVM-based method, we used SVM multi-classification algorithm to optimize the parameters of penalty factor and kernel function. With DBN, we adjusted different parameters to achieve the best performance when solving different emotions. Both gender-dependent and gender-independent experiments were conducted on the Chinese Academy of Sciences emotional speech database. The mean accuracy of SVM is 84.54%, and the mean accuracy of DBN is 94.6%. The experiments show that the DBN-based approach has good potential for practical usage, and suitable feature fusions will further improve the performance of speech emotion recognition. Copyright (c) 2017 John Wiley & Sons, Ltd.
机译:情感识别对于理解人们具有挑战性,并增强了人机交互体验,这有助于智能医疗保健和其他智能服务的和谐运行。本文提取了几种语音特征,如梅尔频率倒谱系数,音调和共振峰,并以不同的方式进行了组合,以反映特征融合与情感识别性能之间的关系。此外,我们探索了两种方法,即支持向量机(SVM)和深度信念网络(DBN),对六种情绪状态进行了分类:愤怒,恐惧,喜悦,中立状态,悲伤和惊奇。在基于支持向量机的方法中,我们使用了支持向量机的多分类算法来优化惩罚因子和核函数的参数。使用DBN,我们调整了不同的参数以在解决不同的情绪时获得最佳性能。在中国科学院情感语音数据库上进行了性别相关和性别无关的实验。 SVM的平均准确度为84.54%,DBN的平均准确度为94.6%。实验表明,基于DBN的方法具有良好的实际应用潜力,适当的特征融合将进一步提高语音情感识别的性能。版权所有(c)2017 John Wiley&Sons,Ltd.

著录项

  • 来源
    《Software》 |2017年第8期|1127-1138|共12页
  • 作者单位

    China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

    China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

    Sci & Technol Opt Radiat Lab, Beijing 100854, Peoples R China;

    St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS, Canada;

    China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

    China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

    Fudan Univ, Coll Comp Sci & Technol, Shanghai 200433, Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    speech emotion recognition; feature fusion; support vector machine; deep belief network;

    机译:语音情感识别;特征融合;支持向量机;深度信念网络;
  • 入库时间 2022-08-18 02:50:37

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号