首页> 外文会议>Mobile Multimedia/Image Processing, Security, and Applications >Emotion recognition from speech - Tools and Challenges
【24h】

Emotion recognition from speech - Tools and Challenges

机译:言语 - 工具和挑战的情感认可

获取原文
获取外文期刊封面目录资料

摘要

Human emotion recognition from speech is studied frequently for its importance in many applications, e.g. human-computer interaction. There is a wide diversity and non-agreement about the basic emotion or emotion-related states on one hand and about where the emotion related information lies in the speech signal on the other side. These diversities motivate our investigations into extracting Meta-features using the PCA approach, or using a non-adaptive random projection RP, which significantly reduce the large dimensional speech feature vectors that may contain a wide range of emotion related information. Subsets of Meta-features are fused to increase the performance of the recognition model that adopts the score-based LDC classifier. We shall demonstrate that our scheme outperform the state of the art results when tested on non-prompted databases or acted databases (i.e. when subjects act specific emotions while uttering a sentence). However, the huge gap between accuracy rates achieved on the different types of datasets of speech raises questions about the way emotions modulate the speech. In particular we shall argue that emotion recognition from speech should not be dealt with as a classification problem. We shall demonstrate the presence of a spectrum of different emotions in the same speech portion especially in the non-prompted data sets, which tends to be more "natural" than the acted datasets where the subjects attempt to suppress all but one emotion.
机译:来自言语的人类情感识别在许多应用中经常研究其重要性,例如,在许多应用中进行重要性。人机交互。一方面,关于基本情绪或情感相关的州的广泛多样性和非协议,以及情绪相关信息在另一侧的语音信号中。这些多样性使我们的调查能够使用PCA方法来提取元特征,或者使用非自适应随机投影RP来提取元特征,这显着减少了可能包含各种情绪相关信息的大型语音特征向量。元特征的子集被融合,以提高采用基于分数的LDC分类器的识别模型的性能。我们将展示我们的方案在在非提示的数据库或Acted数据库上测试时,我们的方案优于现有技术的状态(即,当主题在发出句子时采取特殊情绪时)。然而,在不同类型的语音数据集上实现的精度率之间的巨大差距提出了关于情绪调制演讲的方式的问题。特别是,我们应该争辩说,不应作为分类问题处理言论的情绪认可。我们将展示在相同的语音部分中的存在谱的存在,特别是在非提示的数据集中,这往往比被动的数据集更加“自然”,其中受试者试图抑制所有的一种情绪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号