首页> 外文会议> >Intermediary Fuzzification in Speech Emotion Recognition
【24h】

Intermediary Fuzzification in Speech Emotion Recognition

机译:语音情感识别中的中间模糊化

获取原文

摘要

Affective systems are getting increasingly more attention from researchers and high-tech companies in order to enable the acknowledgment or adaptation to a user’s mood. Emotion classification is typically a hard problem due to the number of subtle cues which are present in human facial and body expressions, or in voiced utterances. Another critical factor is that typically used models tend to map emotions into all-or-nothing regions with artificially sharp divisions among them, a view which is rather unsupported in the field of psychology and human behavioral analysis. In this paper we propose the inclusion of an intermediary fuzzy layer in a VGGVox-based NN, whose aim is to deal with the inherently foggy transitions between emotional states. This neuro-fuzzy model was trained and evaluated against four emotional speech databases and has shown improvements in the classification performance over a non-fuzzy counterpart. Observed performances were also on-par or above those of other current state-of-the-art techniques.
机译:情感系统越来越受到研究人员和高科技公司的关注,以便能够确认或适应用户的情绪。由于存在于人的面部和身体表情或发声中的微妙线索的数量,情绪分类通常是一个难题。另一个关键因素是,通常使用的模型倾向于将情感映射到全有或全无的区域,其中人为地将其划分得很清晰,这一观点在心理学和人类行为分析领域中是相当缺乏支持的。在本文中,我们建议在基于VGGVox的NN中包括一个中间模糊层,其目的是处理情绪状态之间固有的模糊过渡。该神经模糊模型是针对四个情感语音数据库进行训练和评估的,与非模糊模型相比,该模型在分类性能上已有改进。观察到的性能也达到或超过了其他当前最新技术水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号