...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features
【24h】

Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features

机译:通过使用MEL-频率谱系统的机器学习技术识别基本的情绪状态语音讲话

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Speech Emotion Recognition (SER) has been widely used in many fields, such as smart home assistants commonly found in the market. Smart home assistants that could detect the user's emotion would improve the communication between a user and the assistant enabling the assistant to offer more productive feedback. Thus, the aim of this work is to analyze emotional states in speech and propose a suitable algorithm considering performance verses complexity for deployment in smart home devices. The four emotional speech sets were selected from the Berlin Emotional Database (EMO-DB) as experimental data, 26 MFCC features were extracted from each type of emotional speech to identify the emotions of happiness, anger, sadness and neutrality. Then, speaker-independent experiments for our Speech emotion Recognition (SER) were conducted by using the Back Propagation Neural Network (BPNN), Extreme Learning Machine (ELM), Probabilistic Neural Network (PNN) and Support Vector Machine (SVM). Synthesizing the recognition accuracy and processing time, this work shows that the performance of SVM was the best among the four methods as a good candidate to be deployed for SER in smart home devices. SVM achieved an overall accuracy of 92.4% while offering low computational requirements when training and testing. We conclude that the MFCC features and the SVM classification models used in speaker-independent experiments are highly effective in the automatic prediction of emotion.
机译:语音情感认可(SER)已广泛应用于许多领域,例如市场上常见的智能家庭助理。可以检测用户的情感的智能家庭助理将改善用户和助手之间的沟通,使助手能够提供更高效的反馈。因此,这项工作的目的是分析言论中的情绪状态,提出了一种适当的算法,考虑在智能家居设备中进行部署的性能复杂性。从柏林情绪数据库(EMO-DB)中选择了四种情绪语音集合,作为实验数据,从每种情绪演讲中提取26个MFCC功能,以确定幸福,愤怒,悲伤和中立的情绪。然后,通过使用后传播神经网络(BPNN),极端学习机(ELM),概率神经网络(PNN)和支持向量机(SVM)来进行语音情感识别(SER)的扬声器的独立实验。综合识别准确性和处理时间,这项工作表明,SVM的性能是四种方法中最好的,作为智能家居设备中的SER部署的良好候选者。 SVM实现了92.4%的整体准确性,同时在培训和测试时提供低计算要求。我们得出结论,MFCC特征和在扬声器的实验中使用的SVM分类模型在自动预测情绪中非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号