Iterative Visual Reasoning Beyond Convolutions

机译：超越卷积的迭代视觉推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel framework for iterative visual reasoning. Our framework goes beyond current recognition systems that lack the capability to reason beyond stack of convolutions. The framework consists of two core modules: a local module that uses spatial memory [4] to store previous beliefs with parallel updates; and a global graph-reasoning module. Our graph module has three components: a) a knowledge graph where we represent classes as nodes and build edges to encode different types of semantic relationships between them; b) a region graph of the current image where regions in the image are nodes and spatial relationships between these regions are edges; c) an assignment graph that assigns regions to classes. Both the local module and the global module roll-out iteratively and cross-feed predictions to each other to refine estimates. The final predictions are made by combining the best of both modules with an attention mechanism. We show strong performance over plain ConvNets, e.g. achieving an 8.4% absolute improvement on ADE [55] measured by per-class average precision. Analysis also shows that the framework is resilient to missing regions for reasoning.

机译：我们提出了一种新颖的视觉迭代推理框架。我们的框架超越了当前的识别系统，后者缺乏超出卷积堆栈的推理能力。该框架由两个核心模块组成：一个使用空间内存[4]来存储先前信念并进行并行更新的本地模块；以及全局图推理模块。我们的图模块包含三个部分：a）知识图，其中我们将类表示为节点，并构建边缘以对它们之间的不同类型的语义关系进行编码； b）当前图像的区域图，其中图像中的区域是节点，这些区域之间的空间关系是边缘； c）将区域分配给类的分配图。本地模块和全局模块都以迭代方式推出，并且相互交叉预测以优化估计。最终的预测是通过将两个模块的优点与注意力机制相结合而做出的。与普通的ConvNets相比，我们显示出强大的性能通过按班级平均精度测得的ADE [55]绝对精度提高了8.4％。分析还表明，该框架可以对缺失的区域进行推理。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2018年|7239-7248|共10页
会议地点 Salt Lake City(US)
作者
Xinlei Chen; Li-Jia Li; Li Fei-Fei; Abhinav Gupta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cognition; Semantics; Visualization; Image edge detection; Knowledge based systems; Computer architecture; Automobiles;

机译：认识;语义学可视化；图像边缘检测；基于知识的系统；计算机架构;汽车类;
入库时间 2022-08-26 14:35:32

相似文献

外文文献
中文文献
专利

1. Intelligent Visual Reasoning Tutor: an Intelligent Tutoring System for Visual Reasoning in Engineering & Architecture [J] . YONG SE KIM, ERIC WANG The international journal of engineering education . 2009,第4期

机译：智能视觉推理家：工程与建筑视觉推理智能家教系统
2. New Interpretation to Variational Iteration Method: Convolution Iteration Method Based on Duhamel's Principle for Dynamic System Analysis [J] . Yunhua Li, Yunze Li, Chieh-Li Chen, Computer Modeling in Engineering & Sciences . 2010,第1期

机译：变分迭代法的新诠释：基于杜哈默动力系统分析原理的卷积迭代法
3. Reasoning about time from space: Visual continuity may disrupt reasoning about the passage of time within accreted materials [J] . Rachel A. Myer, Thomas F. Shipley, Alexandra K. Davatzes Journal of Geoscience Education . 2018,第2期

机译：从空间推出时间：视觉连续性可能会破坏在增强材料内的时间流逝的推理
4. Iterative Visual Reasoning Beyond Convolutions [C] . Xinlei Chen, Li-Jia Li, Li Fei-Fei, IEEE/CVF Conference on Computer Vision and Pattern Recognition . 2018

机译：超越卷积的迭代视觉推理
5. Effective and Efficient Convolutional Architectures for Visual Recognition [D] . Ma, Ningning. 2021

机译：用于视觉识别的有效和高效的卷积架构
6. Robust Model-Free Adaptive Iterative Learning Control for Vibration Suppression Based on Evidential Reasoning [O] . Liang Bai, Yun-Wen Feng, Ning Li, 2016

机译：基于证据推理的鲁棒抗振动鲁棒无模型自适应迭代学习控制
7. Iterative Visual Reasoning Beyond Convolutions [O] . Xinlei Chen, Li-Jia Li, Li Fei-Fei, 2018

机译：超越卷积的迭代视觉推理
8. Iterative and combined trellis decoding of GMSK and FQPSK with short constraint length convolutional codes [R] . Lee, D., Lee, T. -Y. 2001

机译：具有短约束长度卷积码的GmsK和FQpsK的迭代和组合网格解码

Iterative Visual Reasoning Beyond Convolutions

摘要

著录项

相似文献

相关主题

期刊订阅