【24h】

Emotional Speech: A Spectral Analysis

机译:情感演讲:频谱分析

获取原文

摘要

Feature extraction and dimensionality reduction may be found as the most imperative parts of emotional speech recognition problem. In this work, we propose a new set of speech features based on the distribution of energy in frequency domain. To investigate the applicability of the proposed model, we have set the first international audio/visual emotion challenge (AVEC 2011) as a benchmark. As for the modeling and dimensionality reduction, we have employed the lasso. It is shown how 15 explicit spectral energy features, as suggested in this work, can lead to a more accurate model than those of the participants in the audio sub-challenge. This is while this number of features is less than ten percent of the smallest set of features in the challenge.
机译:特征提取和降维可能是情感语音识别问题中最重要的部分。在这项工作中,我们基于频域中的能量分布提出了一组新的语音特征。为了研究所提出模型的适用性,我们以第一个国际视听情感挑战赛(AVEC 2011)为基准。至于建模和降维,我们使用套索。它显示了这项工作中建议的15种显式频谱能量特征如何比音频子挑战中的参与者的模型更精确的模型。与此相比,此功能的数量还不足挑战中最小功能集的百分之十。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号