A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips

Tuan-Linh Nguyen; Kavuri Swathi; Lee Minho

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips

【24h】

A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips

机译：一种多模式卷积神经模糊网络，用于电影剪辑的情感理解

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimodal emotion understanding enables AI systems to interpret human emotions. With accelerated video surge, emotion understanding remains challenging due to inherent data ambiguity and diversity of video content. Although deep learning has made a considerable progress in big data feature learning, they are viewed as deterministic models used in a "black-box" manner which does not have capabilities to represent inherent ambiguities with data. Since the possibility theory of fuzzy logic focuses on knowledge representation and reasoning under uncertainty, we intend to incorporate the concepts of fuzzy logic into deep learning framework. This paper presents a novel convolutional neuro-fuzzy network, which is an integration of convolutional neural networks in fuzzy logic domain to extract high-level emotion features from text, audio, and visual modalities. The feature sets extracted by fuzzy convolutional layers are compared with those of convolutional neural networks at the same level using t-distributed Stochastic Neighbor Embedding. This paper demonstrates a multimodal emotion understanding framework with an adaptive neural fuzzy inference system that can generate new rules to classify emotions. For emotion understanding of movie clips, we concatenate audio, visual, and text features extracted using the proposed convolutional neuro-fuzzy network to train adaptive neural fuzzy inference system. In this paper, we go one step further to explain how deep learning arrives at a conclusion that can guide us to an interpretable AI. To identify which visual/text/audio aspects are important for emotion understanding, we use direct linear non-Gaussian additive model to explain the relevance in terms of causal relationships between features of deep hidden layers. The critical features extracted are input to the proposed multimodal framework to achieve higher accuracy. (C) 2019 Elsevier Ltd. All rights reserved.

机译：多模式情绪理解使AI系统能够解释人类的情绪。通过加速视频激增，情绪理解仍然是挑战，由于内涵和视频内容的多样性。虽然深入学习在大数据特征学习中取得了相当大的进展，但它们被视为以“黑匣子”方式使用的确定性模型，这些模型不具有代表数据表示固有的歧义的功能。由于模糊逻辑的可能性理论侧重于在不确定性下的知识表示和推理，我们打算将模糊逻辑的概念纳入深度学习框架。本文提出了一种新颖的卷积神经模糊网络，它是模糊逻辑域中的卷积神经网络的集成，以从文本，音频和视觉模式中提取高级情感特征。使用模糊卷积层提取的特征集与使用T分布式随机邻嵌入的相同水平的卷积神经网络中提取的特征集。本文展示了具有自适应神经模糊推理系统的多模式情感理解框架，可以生成新规则来分类情绪。对于对电影剪辑的情感理解，我们使用所提出的卷积神经模糊网络提取的音频，视觉和文本特征来培训自适应神经模糊推理系统。在这篇论文中，我们进一步走了一步，解释了深度学习如何到达的结论，可以指导我们解释一个可解释的AI。为了确定哪些视觉/文本/音频方面对于情感理解很重要，我们使用直线性非高斯添加剂模型来解释在深隐藏层的特征之间的因果关系方面的相关性。提取的关键特征被输入到所提出的多模型框架，以实现更高的精度。（c）2019年elestvier有限公司保留所有权利。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2019年第2019期|共12页
作者
Tuan-Linh Nguyen; Kavuri Swathi; Lee Minho;
展开▼
作者单位

Kyungpook Natl Univ Sch Elect Engn IT-1 Daegu 41566 South Korea;

Kyungpook Natl Univ Sch Elect Engn IT-1 Daegu 41566 South Korea;

Kyungpook Natl Univ Sch Elect Engn IT-1 Daegu 41566 South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学;
关键词
Fuzzy logic; Deep learning (DL); Convolutional Neural Network (CNN); Convolutional Neuro-Fuzzy Network (CNFN); Multimodal emotion understanding; Interpretable AI;

机译：模糊逻辑;深度学习（DL）;卷积神经网络（CNN）;卷积神经模糊网络（CNFN）;多式联利情感理解;可解释的AI;

相似文献

外文文献
中文文献
专利

1. A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips [J] . Tuan-Linh Nguyen, Kavuri Swathi, Lee Minho Neural Networks: The Official Journal of the International Neural Network Society . 2019,第期

机译：一种多模式卷积神经模糊网络，用于电影剪辑的情感理解
2. Multimodal Emotion Recognition Based on Ensemble Convolutional Neural Network [J] . Huang Haiping, Hu Zhenchao, Wang Wenming, Quality Control, Transactions . 2020,第期

机译：基于集合卷积神经网络的多模式情感识别
3. Multimodal speech emotion recognition and classification using convolutional neural network techniques [J] . A. Christy, S. Vaithyasubramanian, A. Jesudoss, International journal of speech technology . 2020,第2期

机译：利用卷积神经网络技术进行多式联运语音情感识别与分类
4. Emotion Understanding in Movie Clips Based on EEG Signal Analysis [C] . Mingu Kwon, Minho Lee International conference on neural information processing . 2012

机译：基于脑电信号分析的电影剪辑情感理解
5. Multimodal Data Fusion and Feature Visualization in Convolutional Neural Networks [D] . Punjabi, Arjun Naresh. 2020

机译：卷积神经网络中的多模式数据融合与特征可视化
6. Evaluating and Validating Emotion Elicitation Using English and Arabic Movie Clips on a Saudi Sample [O] . Sharifa Alghowinem, Roland Goecke, Michael Wagner, 2019

机译：在沙特样本上使用英语和阿拉伯语电影剪辑评估和验证情感启发
7. Convolutional Attention Networks for Multimodal Emotion Recognition from Speech and Text Data [O] . Woo Yong Choi, Kyu Ye Song, Chan Woo Lee 2018

机译：来自语音和文本数据的多模式情绪识别的卷积注意网络

A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips

摘要

著录项

相似文献

相关主题

期刊订阅