...
首页> 外文期刊>Multimedia Tools and Applications >Continuous affect recognition with weakly supervised learning
【24h】

Continuous affect recognition with weakly supervised learning

机译:通过弱监督学习来持续识别情感

获取原文
获取原文并翻译 | 示例

摘要

Recognizing a person's affective state from audio-visual signals is an essential capability for intelligent interaction. Insufficient training data and the unreliable labels of affective dimensions (e.g., valence and arousal) are two major challenges in continuous affect recognition. In this paper, we propose a weakly supervised learning approach based on hybrid deep neural network and bidirectional long short-term memory recurrent neural network (DNN-BLSTM). It firstly maps the audio/visual features into a more discriminative space via the powerful modelling capacities of DNN, then models the temporal dynamics of affect via BLSTM. To reduce the negative impact of the unreliable labels, we utilize a temporal label (TL) along with a robust loss function (RL) for incorporating weak supervision into the learning process of the DNN-BLSTM model. Therefore, the proposed method not only has a simpler structure than the deep BLSTM model in He et al. (24) which requires more training data, but also is robust to noisy and unreliable labels. Single modal and multimodal affect recognition experiments have been carried out on the RECOLA dataset. Single modal recognition results show that the proposed method with TL and RL obtains remarkable improvements on both arousal and valence in terms of concordance correlation coefficient (CCC), while multimodal recognition results show that with less feature streams, our proposed approach obtains better or comparable results with the state-of-the-art methods.
机译:从视听信号识别人的情感状态是智能交互的一项基本功能。训练数据不足和情感维度的标签不可靠(例如,效价和唤醒)是持续进行情感识别的两个主要挑战。本文提出了一种基于混合深度神经网络和双向长短期记忆递归神经网络(DNN-BLSTM)的弱监督学习方法。它首先通过DNN强大的建模功能将音频/视频特征映射到更具区分性的空间,然后通过BLSTM对情感的时间动态进行建模。为了减少不可靠标签的负面影响,我们利用时间标签(TL)和鲁棒损失函数(RL)将弱监督纳入DNN-BLSTM模型的学习过程中。因此,所提出的方法不仅比He等人的深度BLSTM模型具有更简单的结构。 (24)不仅需要更多的训练数据,而且对于嘈杂和不可靠的标签也很可靠。对RECOLA数据集进行了单模态和多模态影响识别实验。单模态识别结果表明,本文提出的TL和RL方法在一致性和相关系数(CCC)方面在唤醒和价数方面均取得了显着改进,而多模态识别结果表明,特征流较少时,我们的方法可获得更好或可比的结果使用最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号