...
首页> 外文期刊>International journal of speech technology >Segment based emotion recognition using combined reduced features
【24h】

Segment based emotion recognition using combined reduced features

机译:使用组合的简化功能进行基于段的情感识别

获取原文
获取原文并翻译 | 示例
           

摘要

The attitude of a human being involves with their emotions. Emotions can be observed in either verbally or visually or both. Verbal emotion recognition is a difficult task and an area of speech processing. It has a wide variety of applications in almost all fields. In this work, the authors have tried to recognize five types of emotion as anger, sadness, happiness, fear, and neutral. The work is focussed on the choice of spectral feature computation. For such purpose, Mel-frequency Cepstral coefficients (MFCC), spectral roll-off, spectral centroid and spectral flux are considered on frame-level extraction. Some of these features need to be reduced, combined, and balanced. The combined methods are verified and observed the effectiveness of results. The resulting features are used with neural network (NN) based models for recognition purpose. The models of multilayer perceptron (MLP), radial basis function network (RBFN), probabilistic neural network (PNN) and deep neural network (DNN) are considered and tested for the chosen features. It is observed that less amount of features provides reliable accuracy in case of PNN. The same utilizes less time for training and testing in case of MLP, RBFN, and PNN. However, DNN is not suitable for fewer amounts of features. It requires large data for better accuracy in the particular field. The results support the PNN with an average accuracy of 96.9% with low-dimensional feature sets, whereas the average accuracy of MLP, RBFN, DNN models found 90.1 %, 92.7%, and 73.6% respectively.
机译:人的态度与他们的情感有关。可以用言语或视觉或两者兼有的方式观察情绪。言语情感识别是一项艰巨的任务,也是语音处理领域。它在几乎所有领域中都有广泛的应用。在这项工作中,作者试图识别出五种类型的情绪:愤怒,悲伤,幸福,恐惧和中立。这项工作集中在光谱特征计算的选择上。为此,在帧级提取中考虑梅尔频率倒谱系数(MFCC),频谱滚降,频谱质心和频谱通量。其中一些功能需要减少,组合和平衡。验证了组合方法并观察了结果的有效性。所得特征与基于神经网络(NN)的模型一起用于识别目的。考虑了多层感知器(MLP),径向基函数网络(RBFN),概率神经网络(PNN)和深度神经网络(DNN)的模型,并针对所选功能进行了测试。可以看出,在PNN的情况下,较少的特征量提供了可靠的准确性。在MLP,RBFN和PNN的情况下,相同的方法会花费较少的时间进行培训和测试。但是,DNN不适合较少数量的功能。它需要大数据才能在特定领域获得更高的准确性。结果支持低维特征集的PNN的平均准确度为96.9%,而MLP,RBFN和DNN模型的平均准确度分别为90.1%,92.7%和73.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号