首页> 外文会议>First grand challenge and workshop on human multimodal language 2018 >Convolutional Attention Networks for Multimodal Emotion Recognition from Speech and Text Data
【24h】

Convolutional Attention Networks for Multimodal Emotion Recognition from Speech and Text Data

机译:卷积注意力网络用于语音和文本数据的多模态情感识别

获取原文
获取原文并翻译 | 示例

摘要

Emotion recognition has become a popular topic of interest, especially in the field of human computer interaction. Previous works involve unimodal analysis of emotion, while recent efforts focus on multi-modal emotion recognition from vision and speech. In this paper, we propose a new method of learning about the hidden representations between just speech and text data using convolutional attention networks. Compared to the shallow model which employs simple concatenation of feature vectors, the proposed attention model performs much better in classifying emotion from speech and text data contained in the CMU-MOSEI dataset.
机译:情绪识别已成为人们关注的热门话题,尤其是在人机交互领域。以前的工作涉及对情绪的单峰分析,而最近的工作则集中于从视觉和语音中识别多峰的情绪。在本文中,我们提出了一种使用卷积注意力网络来学习语音和文本数据之间的隐藏表示的新方法。与采用特征向量的简单连接的浅层模型相比,所提出的注意力模型在根据CMU-MOSEI数据集中包含的语音和文本数据对情感进行分类方面表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号