Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services

Zhang Weishan; Zhao Dehai; Chai Zhi; Yang Laurence T.; Liu Xin; Gong Faming; Yang Su

首页> 外文期刊>Software >Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services

【24h】

Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services

机译：深度学习和基于SVM的中文语音情感识别技术可提供智能情感服务

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Emotion recognition is challenging for understanding people and enhances human-computer interaction experiences, which contributes to the harmonious running of smart health care and other smart services. In this paper, several kinds of speech features such as Mel frequency cepstrum coefficient, pitch, and formant were extracted and combined in different ways to reflect the relationship between feature fusions and emotion recognition performance. In addition, we explored two methods, namely, support vector machine (SVM) and deep belief networks (DBNs), to classify six emotion status: anger, fear, joy, neutral status, sadness, and surprise. In the SVM-based method, we used SVM multi-classification algorithm to optimize the parameters of penalty factor and kernel function. With DBN, we adjusted different parameters to achieve the best performance when solving different emotions. Both gender-dependent and gender-independent experiments were conducted on the Chinese Academy of Sciences emotional speech database. The mean accuracy of SVM is 84.54%, and the mean accuracy of DBN is 94.6%. The experiments show that the DBN-based approach has good potential for practical usage, and suitable feature fusions will further improve the performance of speech emotion recognition. Copyright (c) 2017 John Wiley & Sons, Ltd.

机译：情感识别对于理解人们具有挑战性，并增强了人机交互体验，这有助于智能医疗保健和其他智能服务的和谐运行。本文提取了几种语音特征，如梅尔频率倒谱系数，音调和共振峰，并以不同的方式进行了组合，以反映特征融合与情感识别性能之间的关系。此外，我们探索了两种方法，即支持向量机（SVM）和深度信念网络（DBN），对六种情绪状态进行了分类：愤怒，恐惧，喜悦，中立状态，悲伤和惊奇。在基于支持向量机的方法中，我们使用了支持向量机的多分类算法来优化惩罚因子和核函数的参数。使用DBN，我们调整了不同的参数以在解决不同的情绪时获得最佳性能。在中国科学院情感语音数据库上进行了性别相关和性别无关的实验。 SVM的平均准确度为84.54％，DBN的平均准确度为94.6％。实验表明，基于DBN的方法具有良好的实际应用潜力，适当的特征融合将进一步提高语音情感识别的性能。版权所有（c）2017 John Wiley＆Sons，Ltd.

著录项

来源
《Software》 |2017年第8期|1127-1138|共12页
作者
Zhang Weishan; Zhao Dehai; Chai Zhi; Yang Laurence T.; Liu Xin; Gong Faming; Yang Su;
展开▼
作者单位

China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

Sci & Technol Opt Radiat Lab, Beijing 100854, Peoples R China;

St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS, Canada;

China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

China Univ Petr, Sch Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China;

Fudan Univ, Coll Comp Sci & Technol, Shanghai 200433, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
speech emotion recognition; feature fusion; support vector machine; deep belief network;

机译：语音情感识别;特征融合;支持向量机;深度信念网络;
入库时间 2022-08-18 02:50:37

相似文献

外文文献
中文文献
专利

1. Deep features-based speech emotion recognition for smart affective services [J] . Badshah Abdul Malik, Rahim Nasir, Ullah Noor, Multimedia Tools and Applications . 2019,第5期

机译：基于深度特征的语音情感识别，用于智能情感服务
2. Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN [J] . Lianzhang Zhu, Leiming Chen, Dehai Zhao, Sensors . 2017,第7期

机译：SVM与DBN结合使用中文语音进行智能情感服务的情感识别
3. Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN [J] . Lianzhang Zhu, Leiming Chen, Dehai Zhao, Sensors . 2017,第7期

机译：SVM与DBN结合使用中文语音进行智能情感服务的情感识别
4. An Affective Service based on Multi-Modal Emotion Recognition, using EEG enabled Emotion Tracking and Speech Emotion Recognition [C] . Danai Styliani Moschona IEEE International Conference on Consumer Electronics - Asia . 2020

机译：一种基于多模态情绪识别的情感服务，使用EEG使能情感跟踪和语音情感认可
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN [O] . Lianzhang Zhu, Leiming Chen, Dehai Zhao, 2017

机译：SVM与DBN结合使用中文语音进行智能情感服务的情感识别
7. Towards real-time speech emotion recognition for affective e-learning [O] . 2016

机译：走向实时语音情感识别以进行情感电子学习

Deep learning and SVM-based emotion recognition from Chinese speech for smart affective services

摘要

著录项

相似文献

相关主题

期刊订阅