Data Augmentation Techniques for the Video Question Answering Task

机译：视频问题应答任务的数据增强技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video Question Answering (VideoQA) is a task that requires a model to analyze and understand both the visual content given by the input video and the textual part given by the question, and the interaction between them in order to produce a meaningful answer. In our work we focus on the Egocentric VideoQA task, which exploits first-person videos, because of the importance of such task which can have impact on many different fields, such as those pertaining the social assistance and the industrial training. Recently, an Egocentric VideoQA dataset, called EgoVQA, has been released. Given its small size, models tend to overfit quickly. To alleviate this problem, we propose several augmentation techniques which give us a +5.5% improvement on the final accuracy over the considered baseline.

机译：视频问题应答（VideoQA）是一个任务，需要模型来分析和理解输入视频和问题给出的文本部分的视觉内容，以及它们之间的交互，以便产生有意义的答案。在我们的工作中，我们专注于Egocentric VideoQA任务，该任务利用第一人称视频，因为此类任务的重要性可能会对许多不同的领域产生影响，例如有关社会援助和工业培训。最近，已释放了一个名为EGOVQA的Egocentric VideoQA数据集。鉴于其体积小，型号往往会迅速过度。为了缓解这一问题，我们提出了几种增强技术，使我们在考虑基线上的最终准确性提高了5.5％。

著录项

来源
《European conference on computer vision》|2020年|511-525|共15页
会议地点
作者
Alex Falcon; Oswald Lanz; Giuseppe Serra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Vision and language; Video Question Answering; Egocentric vision; Data augmentation;

机译：愿景和语言;视频问题应答;Egocentric Vision;数据增强;

相似文献

外文文献
中文文献
专利

1. Unifying the Video and Question Attentions for Open-Ended Video Question Answering [J] . Hongyang Xue, Zhou Zhao, Deng Cai IEEE Transactions on Image Processing . 2017,第12期

机译：统一开放式视频问答的视频和问题注意
2. QUESTION ANSWERING SYSTEM : A REVIEW ON QUESTION ANALYSIS, DOCUMENT PROCESSING, AND ANSWER EXTRACTION TECHNIQUES [J] . FANDY SETYO UTOMO, NANNA SURYANA, MOHD SANUSI AZMI Journal of Theoretical and Applied Information Technology . 2017,第14期

机译：问题应答系统：问题分析，文档处理和回答提取技术综述
3. IMPROVING ARABIC QUESTION ANSWERING SYSTEM BY MERGING ANER TECHNIQUE, UPDATED QUESTION CLASSIFICATION TECHNIQUE AND STOP WORDS TECHNIQUE [J] . BELAL ABUATA, IYAD ALAZZAM, FADI ALKHAZALEH Journal of Theoretical and Applied Information Technology . 2020,第23期

机译：通过合并AER技术，更新的问题分类技术和停止单词技术改进阿拉伯问题应答系统
4. An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering [C] . Shayne Longpre, Yi Lu, Zhucheng Tu, Workshop on machine reading for question answering . 2019

机译：领域无关问题回答的数据扩充和采样技术的探索
5. Neural Network Models for Tasks in Open-domain and Closed-domain Question Answering [D] . ?Chen, Charles L. 2020

机译：神经网络模型在开放领域和闭域问答系统任务
6. Differential Data Augmentation Techniques for Medical Imaging Classification Tasks [O] . Zeshan Hussain, Francisco Gimenez, Darvin Yi, 2017

机译：用于医学影像分类任务的差分数据增强技术
7. Data Augmentation Techniques for the Video Question Answering Task [O] . Alex Falcon, Oswald Lanz, Giuseppe Serra 2020

机译：视频问题应答任务的数据增强技术
8. First Steps Toward Linking Dialogues: Mediating Between Free-text Questions and Pre-recorded Video Answers [R] . Gandhe, S. , Gordon, A. , Leuski, A. , 2004

机译：连接对话的第一步：在自由文本问题和预先录制的视频答案之间进行调解

Data Augmentation Techniques for the Video Question Answering Task

摘要

著录项

相似文献

相关主题

期刊订阅