Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features

Yang Ningning; Dey Nilanjan; Sherratt R. Simon; Shi Fuqian

首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features

【24h】

Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features

机译：通过使用MEL-频率谱系统的机器学习技术识别基本的情绪状态语音讲话

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech Emotion Recognition (SER) has been widely used in many fields, such as smart home assistants commonly found in the market. Smart home assistants that could detect the user's emotion would improve the communication between a user and the assistant enabling the assistant to offer more productive feedback. Thus, the aim of this work is to analyze emotional states in speech and propose a suitable algorithm considering performance verses complexity for deployment in smart home devices. The four emotional speech sets were selected from the Berlin Emotional Database (EMO-DB) as experimental data, 26 MFCC features were extracted from each type of emotional speech to identify the emotions of happiness, anger, sadness and neutrality. Then, speaker-independent experiments for our Speech emotion Recognition (SER) were conducted by using the Back Propagation Neural Network (BPNN), Extreme Learning Machine (ELM), Probabilistic Neural Network (PNN) and Support Vector Machine (SVM). Synthesizing the recognition accuracy and processing time, this work shows that the performance of SVM was the best among the four methods as a good candidate to be deployed for SER in smart home devices. SVM achieved an overall accuracy of 92.4% while offering low computational requirements when training and testing. We conclude that the MFCC features and the SVM classification models used in speaker-independent experiments are highly effective in the automatic prediction of emotion.

机译：语音情感认可（SER）已广泛应用于许多领域，例如市场上常见的智能家庭助理。可以检测用户的情感的智能家庭助理将改善用户和助手之间的沟通，使助手能够提供更高效的反馈。因此，这项工作的目的是分析言论中的情绪状态，提出了一种适当的算法，考虑在智能家居设备中进行部署的性能复杂性。从柏林情绪数据库（EMO-DB）中选择了四种情绪语音集合，作为实验数据，从每种情绪演讲中提取26个MFCC功能，以确定幸福，愤怒，悲伤和中立的情绪。然后，通过使用后传播神经网络（BPNN），极端学习机（ELM），概率神经网络（PNN）和支持向量机（SVM）来进行语音情感识别（SER）的扬声器的独立实验。综合识别准确性和处理时间，这项工作表明，SVM的性能是四种方法中最好的，作为智能家居设备中的SER部署的良好候选者。 SVM实现了92.4％的整体准确性，同时在培训和测试时提供低计算要求。我们得出结论，MFCC特征和在扬声器的实验中使用的SVM分类模型在自动预测情绪中非常有效。

著录项

来源
《Journal of intelligent & fuzzy systems: Applications in Engineering and Technology》 |2020年第1期|共12页
作者
Yang Ningning; Dey Nilanjan; Sherratt R. Simon; Shi Fuqian;
展开▼
作者单位

Wenzhou Med Univ Affiliated Hosp 1 Wenzhou 325035 Peoples R China;

Techno India Coll Technol Dept Informat Technol Kolkata W Bengal India;

Univ Reading Dept Biomed Engn Reading Berks England;

Wenzhou Med Univ Affiliated Hosp 1 Wenzhou 325035 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Emotion recognition; back propagation neural network; extreme learning machine; Mel-frequency cepstral coefficients; smart home; support vector machine;

机译：情绪识别;背部传播神经网络;极端学习机;熔融频率肌肉系数;智能家居;支持向量机;

相似文献

外文文献
中文文献
专利

1. Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features [J] . Yang Ningning, Dey Nilanjan, Sherratt R. Simon, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第2Pta1期

机译：通过使用MEL-频率谱系统的机器学习技术识别基本的情绪状态语音讲话
2. Higher Order Mel-Frequency Cepstral and Autoregressive Reflection Coefficients in Recognizing Three Dimensions of Speech Emotions [J] . A. Milton, S. Tamil Selvi International Journal of Electronics Engineering Research . 2015,第2期

机译：识别语音情感的三个维度的高阶梅尔频率倒谱和自回归反射系数
3. Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients [J] . Milner B., Darch J. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第2期

机译：基于嘈杂的梅尔频率倒谱系数的鲁棒声学语音特征预测
4. Speech Recognition Using Cross Correlation and Feature Analysis Using Mel-Frequency Cepstral Coefficients and Pitch [C] . Ruchi Gupte, Sarah Hawa, Reena Sonkusare IEEE International Conference for Innovation in Technology . 2020

机译：语音识别使用综合相关性和特征分析使用Mel-usian频谱系系数和间距
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features [O] . Ömer Eskidere, Ahmet Gürhanlı 2015

机译：基于多锥梅尔频率倒谱系数特征的语音障碍分类
7. Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures [O] . Darch, Jonathan, Milner, Ben, Vaseghi, Saeed 2008

机译：分布式语音识别架构中基于mel频率倒谱系数的语音特征分析和预测

Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features

摘要

著录项

相似文献

相关主题

期刊订阅