Instance-sequence reasoning for video question answering

Rui LIU; Yahong HAN

首页> 中文期刊> 《中国计算机科学前沿：英文版》 >Instance-sequence reasoning for video question answering

Instance-sequence reasoning for video question answering

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video question answering(Video QA)involves a thorough understanding of video content and question language,as well as the grounding of the textual semantic to the visual content of videos.Thus,to answer the questions more accurately,not only the semantic entity should be associated with certain visual instance in video frames,but also the action or event in the question should be localized to a corresponding temporal slot.It turns out to be a more challenging task that requires the ability of conducting reasoning with correlations between instances along temporal frames.In this paper,we propose an instance-sequence reasoning network for video question answering with instance grounding and temporal localization.In our model,both visual instances and textual representations are firstly embedded into graph nodes,which benefits the integration of intra-and inter-modality.Then,we propose graph causal convolution(GCC)on graph-structured sequence with a large receptive field to capture more causal connections,which is vital for visual grounding and instance-sequence reasoning.Finally,we evaluate our model on TVQA+dataset,which contains the groundtruth of instance grounding and temporal localization,three other Video QA datasets and three multimodal language processing datasets.Extensive experiments demonstrate the effectiveness and generalization of the proposed method.Specifically,our method outperforms the state-of-the-art methods on these benchmarks.

著录项

来源
《中国计算机科学前沿：英文版》 |2022年第6期|93-101|共9页
作者
Rui LIU; Yahong HAN;
展开▼
作者单位

College of Intelligence and Computing;

Tianjin University;

Tianjin;

300350;

China;

Tianjin Key Lab of Machine Learning;

Tianjin University;

Tianjin;

300350;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算技术、计算机技术;
关键词
video question answering; instance grounding; graph causal convolution;

相似文献

中文文献
外文文献
专利

Instance-sequence reasoning for video question answering

摘要

著录项

相似文献

相关主题

期刊订阅