首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
【2h】

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech

机译:用于语音情感识别的堆叠泛化方法的分类器子集选择

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first layer to select the optimal subset from the standard base classifiers. The good performance of the proposed new paradigm was demonstrated over different configurations and datasets. First, several CSS stacking classifiers were constructed on the RekEmozio dataset, using some specific standard base classifiers and a total of 123 spectral, quality and prosodic features computed using in-house feature extraction algorithms. These initial CSS stacking classifiers were compared to other multi-classifier systems and the employed standard classifiers built on the same set of speech features. Then, new CSS stacking classifiers were built on RekEmozio using a different set of both acoustic parameters (extended version of the Geneva Minimalistic Acoustic Parameter Set (eGeMAPS)) and standard classifiers and employing the best meta-classifier of the initial experiments. The performance of these two CSS stacking classifiers was evaluated and compared. Finally, the new paradigm was tested on the well-known Berlin Emotional Speech database. We compared the performance of single, standard stacking and CSS stacking systems using the same parametrization of the second phase. All of the classifications were performed at the categorical level, including the six primary emotions plus the neutral one.
机译:本文提出了一种新的监督分类范式,称为用于分类泛化的分类器子集选择(CSS stacking),以处理语音情感识别。新方法包括对第一级多层分类器系统的改进,该系统通过在第一层集成分布估计算法(EDA)从标准基础分类器中选择最佳子集,从而称为堆栈泛化。在不同的配置和数据集上证明了所提出的新范例的良好性能。首先,在RekEmozio数据集上构造了几个CSS堆栈分类器,使用了一些特定的标准基础分类器,并使用内部特征提取算法计算了总共123个光谱,质量和韵律特征。将这些最初的CSS堆栈分类器与其他多分类器系统进行了比较,并采用了基于相同语音功能集的采用的标准分类器。然后,在RekEmozio上使用不同的声学参数集(日内瓦极简声学参数集(eGeMAPS)的扩展版本)和标准分类器构建了新的CSS堆叠分类器,并采用了初始实验的最佳元分类器。评估并比较了这两个CSS堆栈分类器的性能。最终,新范例在著名的柏林情感语音数据库上进行了测试。我们比较了第二阶段使用相同参数化的单个标准堆叠和CSS堆叠系统的性能。所有分类均在分类级别进行,包括六种主要情绪加上中性情绪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号