首页> 外文会议>International Conference on Information Communication and Embedded Systems >Analysis of windowing techniques for speech emotion recognition
【24h】

Analysis of windowing techniques for speech emotion recognition

机译:语音情感识别的开窗技术分析

获取原文

摘要

Conventionally, the spectral features are derived from the DFT spectrum using the Hamming window. The spectral leakage is reduced by windowing but the variance of the spectral estimate is high. Multitaper method emphasizes on using multiple windows and frequency domain averaging. In this paper we study the impact of introduction of multitapering on the performance of Speech Emotion Recognition. Various spectral features including MFCCs are taken into consideration while the classifier used is Support Vector Machine (SVM). For the spectral features, in case of multitapering an improvement of upto 2% was found as compared to traditional Hamming window when tested on Berlin database. Impact of variable frame size, different windows and variable taper number is also studied.
机译:通常,使用汉明窗从DFT光谱中得出光谱特征。通过开窗减少了光谱泄漏,但是光谱估计值的方差很高。 Multitaper方法强调使用多个窗口和频域平均。在本文中,我们研究了引入多锥度对语音情感识别性能的影响。当使用的分类器是支持向量机(SVM)时,考虑了包括MFCC在内的各种光谱特征。对于光谱特征,在柏林数据库上进行测试时,与传统的汉明窗相比,在多锥度的情况下,可提高2%的性能。还研究了可变框架尺寸,不同窗口和可变锥度的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号