Affective question answering on video

Ruwa Nelson; Mao Qirong; Wang Liangjun; Gou Jianping

首页> 外文期刊>Neurocomputing >Affective question answering on video

【24h】

Affective question answering on video

机译：视频上的情感问答

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Question Answering (VQA) is an increasingly popular research area in machine learning. Most of the existing VQA tasks only focus on static images, and only a few models are based on videos. The primary purpose of this project is to develop an innovative model that performs Affective Question Answering on Video (AQAV), a multi-tasking architecture that implements a Video QA route and an Affective route. A pre-trained CNN emotion detector recognizes emotions on the frames of a video, and a string of the emotion labels is relayed to the Token-based, Frame-based and Integrated attention mechanisms. The attention model uses the visual features, the question and the emotion labels to focus on relevant frames of the video and relevant regions of the frames. The string of emotion labels is used to generate an emotion caption that will be used by the Text QA module to prepare an affective answer. A conventional answer is generated from processes that take place along the Video QA route, while the affective answer is a product of both the Video QA and the Affective routes. Our model does not only make VQA more analytic by generating an explanatory answer, but also registers quantitative improvement in performance, when compared with previous baselines. We managed to prove that the injection of emotions in the attention mechanism boosts VQA performance. The AQAV model contributes towards efforts in making machines understand sequential and dynamic visual scenes in the real world. (C) 2019 Published by Elsevier B.V.

机译：视觉问答（VQA）是机器学习中越来越受欢迎的研究领域。现有的大多数VQA任务仅关注静态图像，只有很少的模型基于视频。该项目的主要目的是开发一种创新的模型，该模型执行视频情感问题解答（AQAV），该多任务体系结构实现了视频质量检查路线和情感路线。预先训练的CNN情感检测器可以识别视频帧上的情感，并将一串情感标签中继到基于令牌，基于帧和集成的注意力机制。注意力模型使用视觉特征，问题和情感标签来关注视频的相关帧和帧的相关区域。情感标签字符串用于生成情感标题，文本QA模块将使用该标题来准备情感回答。传统答案是从视频质量检查路线中发生的过程中生成的，而情感性回答是视频质量检查和情感路线的乘积。与以前的基准相比，我们的模型不仅可以通过生成解释性答案来使VQA更具分析性，而且还可以实现性能的定量提高。我们设法证明，在注意力机制中注入情绪可以提高VQA的表现。 AQAV模型有助于使机器了解现实世界中的顺序和动态视觉场景。（C）2019由Elsevier B.V.发布

著录项

来源
《Neurocomputing》 |2019年第21期|125-139|共15页
作者
Ruwa Nelson; Mao Qirong; Wang Liangjun; Gou Jianping;
展开▼
作者单位

Jiangsu Univ Sch Comp Sci Zhenjiang Jiangsu Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Video question answering; Emotion detection; Video captioning; Multi-task learning;

机译：视频问答;情绪检测;视频字幕;多任务学习;

相似文献

外文文献
中文文献
专利

1. Unifying the Video and Question Attentions for Open-Ended Video Question Answering [J] . Hongyang Xue, Zhou Zhao, Deng Cai IEEE Transactions on Image Processing . 2017,第12期

机译：统一开放式视频问答的视频和问题注意
2. Remember and forget: video and text fusion for video question answering [J] . Feng Gao, Yuanyuan Ge, Yongge Liu Multimedia Tools and Applications . 2018,第22期

机译：记住和忘记：视频和文本融合，用于视频问答
3. Quantitative Analysis of the Usage of a Pedagogical Tool Combining Questions Listed as Learning Objectives and Answers Provided as Online Videos [J] . Dorota Sikora, Odette Laneuville Future Internet . 2015,第2期

机译：结合作为学习目标列出的问题和作为在线视频提供的答案的教学工具的使用情况的定量分析
4. Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering [C] . Jianwen Jiang, Ziqiang Chen, Haojie Lin, AAAI Conference on Artificial Intelligence . 2020

机译：划分和征服：视频问题回答的问候时空语境注意
5. Automatic Neural Question Generation Using Community-Based Question Answering Systems [D] . Baghaee, Tina. 2018

机译：使用基于社区的问题应答系统的自动神经问题
6. The most important questions in cancer research and clinical oncology—Question 2–5. Obesity-related cancers: more questions than answers [O] . Ajit Venniyoor 2017

机译：癌症研究和临床肿瘤学中最重要的问题-问题2–5。肥胖相关癌症：问题多于答案
7. Analysis of the factors of correct and non-responses answers of Japanese language openended questions in elementary school (3) : Does their study habits and the working memory affect to correct and non-responses answers interactively? [O] . 糸満裕, 道田泰司, Itoman Yutaka, 2016

机译：小学日语开放式问题的正确和不回答答案的因素分析（3）：他们的学习习惯和工作记忆是否会影响交互式正确和不回答的答案？
8. First Steps Toward Linking Dialogues: Mediating Between Free-text Questions and Pre-recorded Video Answers [R] . Gandhe, S. , Gordon, A. , Leuski, A. , 2004

机译：连接对话的第一步：在自由文本问题和预先录制的视频答案之间进行调解

Affective question answering on video

摘要

著录项

相似文献

相关主题

期刊订阅