...
首页> 外文期刊>Neurocomputing >Exploiting hierarchical visual features for visual question answering
【24h】

Exploiting hierarchical visual features for visual question answering

机译:利用分层视觉功能进行视觉问答

获取原文
获取原文并翻译 | 示例
           

摘要

Visual question answering (VQA) aims reasoning answers given a pair of textual question and image. Previous approaches for VQA use only the highest layer of a Convolutional Neural Network (CNN) for visual representation, which biases on object classification task. These object-categorization oriented features lose low-level semantics (attribute related questions), e.g., color, texture, and the number of instances. Consequently, conventional VQA methods are vulnerable to low-level semantic questions. On the other hand, low-level layer features retain the low-level semantics. Thus, we suggest that the low-level layer features are superior in low-level semantic questions, and justify it through our experiments. Furthermore, we propose a novel VQA model named Hierarchical Feature Network (HFnet), which exploits intermediate CNN layers to derive various semantics for VQA. In the answer reasoning stage, each hierarchical feature is combined with the attention map and multimodal pooled to consider both high and low level semantic questions. Our proposed model outperforms the existing methods. The qualitative experiments also demonstrate that our proposed HFnet is superior in reasoning attention regions. (C) 2019 Elsevier B.V. All rights reserved.
机译:视觉问题回答(VQA)的目标是给出一对文本问题和图像的推理答案。 VQA的先前方法仅使用卷积神经网络(CNN)的最高层进行视觉表示,这偏向于对象分类任务。这些面向对象分类的功能丢失了低级语义(属性相关的问题),例如颜色,纹理和实例数量。因此,常规的VQA方法容易受到底层语义问题的影响。另一方面,低层图层功能保留了低层语义。因此,我们建议在低层语义问题中低层特征是优越的,并通过我们的实验证明了这一点。此外,我们提出了一种新颖的VQA模型,称为层次特征网络(HFnet),该模型利用中间CNN层来导出VQA的各种语义。在答案推理阶段,将每个层次结构特征与注意力图和多模式合并在一起,以考虑高级语义和低级语义问题。我们提出的模型优于现有方法。定性实验还表明,我们提出的HFnet在推理关注区域方面具有优势。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号