RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering

首页> 外文期刊>Information Sciences: An International Journal >RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering

【24h】

RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering

机译：重复流动：在视觉问题应答中平衡语言理解的单词和句子长度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual question answering (VQA) is a complicated Turing-AI task which needs not only to understand the multi-modality inputs but also reason to provide correct answer. Nowadays, there are complicated and sophisticated modules for reasoning in popular works. However, the language representation which is frequently treated as the guider of VQA hasn't been fully explored in current researches, leading to insufficient reasoning and unsatisfactory answer. In this work, two types of method including VieAns and Repeat-Padding which focus on language processing are proposed to balance the sentence by cropping and padding the question, where the language information is transformed to different expressions and further pushes the language model to grab more representative features for further boosting the accuracy of predicted answers. Experiments on the benchmark COCO-QA and VQA2.0 datasets are conducted to demonstrate the effectiveness of the proposed method. Particularly, the proposed RepeatPadding method is more suitable for different language models. (C) 2020 Elsevier Inc. All rights reserved.

机译：视觉问题应答（VQA）是一种复杂的TINGS-AI任务，不仅需要了解多模态输入，而且需要提供正确答案的原因。如今，有复杂和复杂的模块，因为在流行的作品中推理。然而，经常被视为VQA的指导器的语言表示尚未在当前的研究中得到充分探索，导致推理不足和令人满意的答案。在这项工作中，提出了两种类型的方法，包括关注语言处理的vieans和重复填充，以通过裁剪和填充该问题来平衡句子，其中语言信息被转换为不同的表达式并进一步推动语言模型以获取更多用于进一步提高预测答案的准确性的代表性特征。对基准Coco-QA和VQA2.0数据集进行实验，以证明所提出的方法的有效性。特别是，所提出的重复流动方法更适合于不同的语言模型。（c）2020 Elsevier Inc.保留所有权利。

著录项

来源
《Information Sciences: An International Journal》 |2020年第2020期|共13页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;计算机的应用;信息与知识传播;自动化技术、计算机技术;
关键词
Visual question answering; Convolutional neural network; Recurrent neural network; Sentence transformation;

机译：视觉问题应答;卷积神经网络;经常性神经网络;句子转型;

相似文献

外文文献
中文文献
专利

1. RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering [J] . Information Sciences: An International Journal . 2020,第期

机译：重复流动：在视觉问题应答中平衡语言理解的单词和句子长度
2. Words translated in sentence contexts produce repetition priming in visual word comprehension and spoken word production [J] . Wendy S. Francis, Alejandra Camacho, Carolina Lara Memory & cognition . 2014,第7期

机译：在句子上下文中翻译的单词在视觉单词理解和口语单词生成中产生重复启动
3. Eye-tracking the effect of word order in sentence comprehension in aphasia: evidence from Basque, a free word order ergative language [J] . Arantzeta Miren, Bastiaanse Roelien, Burchert Frank, Language, cognition and neuroscience . 2017,第10期

机译：眼睛追踪词序在失语症中句子理解中的效果：来自巴斯克的证据，一个免费的单词命令是令人友好的语言
4. A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems [C] . Saeedeh Momtazi, Dietrich Klakow 18th ACM conference on information and knowledge management 2009 . 2009

机译：问答系统中基于语言模型的句子检索中的词聚类方法
5. Attention Correction Mechanisms in Visual Contexts in Visual Question Answering [D] . Sharan, Komal 2018

机译：视觉问答中视觉上下文中的注意力纠正机制
6. Everyday Language Exposure Shapes Prediction of Specific Words in Listening Comprehension: A Visual World Eye-Tracking Study [O] . Aine Ito, Hiromu Sakai 2021

机译：日常语言曝光形状在听力理解中的特定词语预测：视觉世界追踪研究
7. Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder [O] . Gouthaman KV, Anurag Mittal 2020

机译：通过视觉接地问题编码器还原视觉问题中的语言偏见
8. The Role of Word Difficulty and Sentence Length in Text Comprehension [R] . Duffy, T. M., U'Ren, P. K. 1982

机译：词汇难度和句子长度在文本理解中的作用

RepeatPadding: Balancing words and sentence length for language comprehension in visual question answering

摘要

著录项

相似文献

相关主题

期刊订阅