Affective Visual Question Answering Network

机译：情感视觉问答网络

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Question Answering (VQA) has recently attracted considerable attention from researchers in the trending field of deep learning. The need to improve VQA models by focusing on local regions of images, has resulted in the development of various attention models. This paper proposes the Affective Visual Question Answering Network (AVQAN), an attention model that combines the locality of the image features, the question and the mood detected from the specific region of the image to produce an affective answer using a preprocessed image dataset. The experimental results depict that AVQAN enriches the analysis and understanding of images by adding affective information to the answer, while still managing to maintain the accuracy levels within the range of recent ordinary VQA baseline models. The proposed model significantly contributes towards the development of rapidly improving emotion-aware machines that are becoming increasingly vital in everyday life.

机译：视觉问答（VQA）最近已经引起了深度学习趋势领域研究人员的极大关注。通过关注图像的局部区域来改进VQA模型的需求导致了各种注意力模型的开发。本文提出了一种情感视觉问题解答网络（AVQAN），该注意力模型结合了图像特征的局部性，问题和从图像特定区域检测到的情绪，从而使用预处理的图像数据集产生了情感答案。实验结果表明，AVQAN通过向答案中添加情感信息来丰富图像的分析和理解，同时仍设法将准确度保持在最近的普通VQA基线模型范围内。所提出的模型极大地促进了迅速改善的情绪感知机器的发展，这些机器在日常生活中变得越来越重要。

著录项

来源
《2018 IEEE Conference on Multimedia Information Processing and Retrieval》|2018年|170-173|共4页
会议地点 Miami(US)
作者
Nelson Ruwa; Qirong Mao; Liangjun Wang; Ming Dong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Mood; Visualization; Knowledge discovery; Detectors; Feature extraction; Analytical models; Computer science;

机译：情绪;可视化;知识发现;检测器;特征提取;分析模型;计算机科学;;

相似文献

外文文献
中文文献
专利

1. Multi-Tier Attention Network using Term-weighted Question Features for Visual Question Answering [J] . Manmadhan Sruthy, Kovoor Binsu C. Image and Vision Computing . 2021,第Nova期

机译：使用术语加权问题的多层关注网络，用于视觉问题应答
2. Object-difference drived graph convolutional networks for visual question answering [J] . Zhu Xi, Mao Zhendong, Chen Zhineng, Multimedia Tools and Applications . 2021,第11期

机译：对象差异驱动的图表卷积网络，用于视觉问题应答
3. Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? [J] . Abhishek Das, Harsh Agrawal, Larry Zitnick, Computer vision and image understanding . 2017,第octa期

机译：视觉问题解答中的人类注意力：人类和深层网络是否看待同一地区？
4. Affective Visual Question Answering Network [C] . Nelson Ruwa, Qirong Mao, Liangjun Wang, IEEE Conference on Multimedia Information Processing and Retrieval . 2018

机译：情感视觉问题应答网络
5. Inferring answer quality, answerer expertise, and ranking in question answer social networks. [D] . Cai, Yuanzhe. 2014

机译：推断回答质量，回答者专业知识以及对问题进行回答的社交网络的排名。
6. An Effective Dense Co-Attention Networks for Visual Question Answering [O] . Shirong He, Dezhi Han 2020

机译：用于视觉问题的有效密集的联合网络
7. Double-layer affective visual question answering network [O] . Zihan Guo, Dezhi Han, Francisco Massetto, 2021

机译：双层情感视觉问题应答网络
8. Questions and Answers on Quality, the ISO 9000 Standard Series, Quality SystemRegistration, and Related Issues. More Questions and Answers on the ISO 9000 Standard Series and Related Issues [R] . Breitenberg, M. 1993

机译：有关质量的问题和解答，IsO 9000标准系列，质量体系注册和相关问题。有关IsO 9000标准系列及相关问题的更多问题和解答

Affective Visual Question Answering Network

摘要

著录项

相似文献

相关主题

期刊订阅