...
首页> 外文期刊>Malaysian Journal of Computer Science >Recognition Of Emotion In Speech Using Variogram Based Features
【24h】

Recognition Of Emotion In Speech Using Variogram Based Features

机译:使用基于变异函数的语音识别语音

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Speech Emotion Recognition (SER) is a relatively new and challenging branch in speech processing area. In this study, we propose new features derived from speech spectrogram using image processing techniques for emotion recognition. For this purpose, variogram graphs are calculated from speech spectrogram. The significant Discrete Cosine Transform (DCT) coefficients of variogram are used as proposed features. The contribution of these features as a complementary for the widely used prosodic and spectral features is also investigated. The feature selection is performed using Fisher Discriminant Ratio (FDR) filtering method. Finally, a linear Support Vector Machine (SVM) classifier is employed. All results are achieved under the 10 fold cross-validation on the Berlin and PDREC speech databases. Our results show that combining the proposed features with prosodic and spectral features significantly improves the classification accuracy. For Berlin database, when the proposed features were added to the prosodic and spectral ones, the recognition rates were improved from 83.18% and 89.36% to 86.82% and 90.43% for females and males, respectively. Also, on the PDREC, combining the proposed features with the prosodic and spectral features improve the recognition rate of females and males by 3.72% and 0.27%, respectively. For this database, the best classification accuracy of 63.18% and 57.37% were obtained for females and males, respectively.
机译:语音情感识别(SER)是语音处理领域中一个相对较新且具有挑战性的分支。在这项研究中,我们提出了使用图像处理技术从语音频谱图派生出的新特征以进行情感识别。为此,从语音频谱图计算出变异函数图。变异函数的显着离散余弦变换(DCT)系数用作建议的特征。还研究了这些特征作为广泛使用的韵律和频谱特征的补充的贡献。使用Fisher判别比(FDR)滤波方法执行特征选择。最后,采用线性支持向量机(SVM)分类器。所有结果均在Berlin和PDREC语音数据库的10倍交叉验证下获得。我们的结果表明,将所提出的特征与韵律和频谱特征相结合可以显着提高分类的准确性。对于柏林数据库,将建议的特征添加到韵律和频谱特征后,女性和男性的识别率分别从83.18%和89.36%提高到86.82%和90.43%。此外,在PDREC上,将建议的特征与韵律和频谱特征相结合,分别将男女识别率提高了3.72%和0.27%。对于该数据库,女性和男性的最佳分类准确率分别为63.18%和57.37%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号