首页> 外文期刊>International journal of speech technology >Modelling speech emotion recognition using logistic regression and decision trees
【24h】

Modelling speech emotion recognition using logistic regression and decision trees

机译:使用逻辑回归和决策树建模语音情感识别

获取原文
获取原文并翻译 | 示例
           

摘要

Speech emotion recognition has been one of the interesting issues in speech processing over the last few decades. Modelling of the emotion recognition process serves to understand as well as assess the performance of the system. This paper compares two different models for speech emotion recognition using vocal tract features namely, the first four formants and their respective bandwidths. The first model is based on a decision tree and the second one employs logistic regression. Whereas the decision tree models are based on machine learning, regression models have a strong statistical basis. The logistic regression models and the decision tree models developed in this work for several cases of binary classifications were validated by speech emotion recognition experiments conducted on a Malayalam emotional speech database of 2800 speech files, collected from ten speakers. The models are not only simple, but also meaningful since they indicate the contribution of each predictor. The experimental results indicate that speech emotion recognition using formants and bandwidths was better modelled using decision trees, which gave higher emotion recognition accuracies compared to logistic regression. The highest accuracy obtained using decision tree was 93.63%, for the classification of positive valence emotional speech as surprised or happy, using seven features. When using logistic regression for the same binary classification, the highest accuracy obtained was 73%, with eight features.
机译:在过去的几十年中,语音情感识别一直是语音处理中有趣的问题之一。情感识别过程的建模有助于理解和评估系统的性能。本文比较了使用声道特征的两种不同的语音情感识别模型,即前四个共振峰及其各自的带宽。第一个模型基于决策树,第二个模型采用逻辑回归。决策树模型基于机器学习,而回归模型具有强大的统计基础。通过在10个说话者的2800个语音文件的马拉雅拉姆语情感语音数据库上进行的语音情感识别实验,验证了这项工作针对几种二元分类情况开发的逻辑回归模型和决策树模型。这些模型不仅简单,而且有意义,因为它们表明了每个预测变量的贡献。实验结果表明,使用决策树更好地建模了使用共振峰和带宽的语音情感识别,与逻辑回归相比,该算法具有更高的情感识别准确性。使用决策树将正价情感语音分类为惊讶或高兴时,使用决策树获得的最高准确性为93.63%。当对相同的二元分类使用逻辑回归时,获得的最高准确性为73%,具有八个功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号