首页> 外文期刊>Computer speech and language >Class-specific multiple classifiers scheme to recognize emotions from speech signals
【24h】

Class-specific multiple classifiers scheme to recognize emotions from speech signals

机译:特定于类别的多重分类器方案,可从语音信号中识别情绪

获取原文
获取原文并翻译 | 示例

摘要

Automatic emotion recognition from speech signals is one of the important research areas, which adds value to machine intelligence. Pitch, duration, energy and Mel-frequency cepstral coefficients (MFCC) are the widely used features in the field of speech emotion recognition. A single classifier or a combination of classifiers is used to recognize emotions from the input features. The present work investigates the performance of the features of Autoregressive (AR) parameters, which include gain and reflection coefficients, in addition to the traditional linear prediction coefficients (LPC), to recognize emotions from speech signals. The classification performance of the features of AR parameters is studied using discriminant, it-nearest neighbor (KNN), Gaussian mixture model (GMM), back propagation artificial neural network (ANN) and support vector machine (SVM) classifiers and we find that the features of reflection coefficients recognize emotions better than the LPC. To improve the emotion recognition accuracy, we propose a class-specific multiple classifiers scheme, which is designed by multiple parallel classifiers, each of which is optimized to a class. Each classifier for an emotional class is built by a feature identified from a pool of features and a classifier identified from a pool of classifiers that optimize the recognition of the particular emotion. The outputs of the classifiers are combined by a decision level fusion technique. The experimental results show that the proposed scheme improves the emotion recognition accuracy. Further improvement in recognition accuracy is obtained when the scheme is built by including MFCC features in the pool of features.
机译:来自语音信号的自动情感识别是重要的研究领域之一,它为机器智能增加了价值。音调,持续时间,能量和梅尔频率倒谱系数(MFCC)是语音情感识别领域中广泛使用的功能。单个分类器或分类器的组合用于从输入特征中识别情绪。本工作研究了自回归(AR)参数的特征的性能,这些参数除了包括传统的线性预测系数(LPC)之外,还包括增益和反射系数,以识别语音信号中的情绪。利用判别式,最近邻(KNN),高斯混合模型(GMM),反向传播人工神经网络(ANN)和支持向量机(SVM)分类器研究了AR参数特征的分类性能。反射系数的特征比LPC更好地识别情绪。为了提高情感识别的准确性,我们提出了一个特定于类别的多重分类器方案,该方案由多个并行分类器设计,每个分类器都针对一个类别进行了优化。情感类别的每个分类器都由从一组特征中标识的特征和从一组优化特定情感识别的分类器中标识的分类器构建。分类器的输出通过决策级融合技术进行组合。实验结果表明,该方案提高了情感识别的准确性。通过在功能池中包括MFCC功能来构建该方案时,可以进一步提高识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号