Modeling Relationships in Referential Expressions with Compositional Modular Networks

机译：使用组合模块化网络对引用表达式中的关系进行建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

People often refer to entities in an image in terms of their relationships with other entities. For example, the black cat sitting under the table refers to both a black cat entity and its relationship with another table entity. Understanding these relationships is essential for interpreting and grounding such natural language expressions. Most prior work focuses on either grounding entire referential expressions holistically to one region, or localizing relationships based on a fixed set of categories. In this paper we instead present a modular deep architecture capable of analyzing referential expressions into their component parts, identifying entities and relationships mentioned in the input expression and grounding them all in the scene. We call this approach Compositional Modular Networks (CMNs): a novel architecture that learns linguistic analysis and visual inference end-to-end. Our approach is built around two types of neural modules that inspect local regions and pairwise interactions between regions. We evaluate CMNs on multiple referential expression datasets, outperforming state-of-the-art approaches on all tasks.

机译：人们经常根据与其他实体的关系来指代图像中的实体。例如，坐在桌子下面的黑猫既指黑猫实体，也指它与另一个表实体的关系。理解这些关系对于解释和扎实自然语言表达至关重要。先前的大多数工作都集中于将整个参照表达全部基于一个区域，或者基于一组固定的类别来定位关系。相反，在本文中，我们提出了一种模块化的深层体系结构，该体系结构能够将引用表达式分析成它们的组成部分，识别输入表达式中提到的实体和关系，并将它们全部扎根在场景中。我们称这种方法为“组合模块化网络（CMN）”：一种新颖的体系结构，可端到端学习语言分析和视觉推理。我们的方法是基于两种类型的神经模块构建的，它们可以检查局部区域以及区域之间的成对交互。我们在多个引用表达数据集上评估CMN，在所有任务上均优于最新方法。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2017年|4418-4427|共10页
会议地点
作者
Ronghang Hu; Marcus Rohrbach; Jacob Andreas; Trevor Darrell; Kate Saenko;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Grounding; Feature extraction; Natural languages; Computer vision; Cats;

机译：可视化;接地;特征提取;自然语言;计算机视觉;猫;

相似文献

外文文献
中文文献
专利

1. Modular rule base fuzzy networks for linguistic composition based modelling [J] . Alexander Gegov, Nedyalko Petrov, David Sanders, International journal of knowledge-based and intelligent engineering systems . 2017,第2期

机译：基于模块化规则的模糊网络，用于基于语言组合的建模
2. Linguistic composition based modelling by fuzzy networks with modular rule bases [J] . Gegov Alexander, Arabikhan Farzad, Petrov Nedyalko Fuzzy sets and systems . 2015,第juna15期

机译：具有模块化规则库的基于模糊网络的语言组合建模
3. Mathematical modeling in genetic networks: relationships between the genetic expression and both chromosomic breakage and positive circuits [J] . Aracena J., Lamine S.B., Mermet M.A., IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2003,第5期

机译：遗传网络中的数学建模：遗传表达与染色体断裂和正电路之间的关系
4. Modeling Relationships in Referential Expressions with Compositional Modular Networks [C] . Ronghang Hu, Marcus Rohrbach, Jacob Andreas, IEEE Conference on Computer Vision and Pattern Recognition . 2017

机译：用组成模块化网络建模关系中的参照表达
5. Modularizing backpropagation neural networks for multisource spatial data modeling and classification. [D] . Wang, Yeqiao. 1995

机译：模块化的反向传播神经网络，用于多源空间数据建模和分类。
6. Relationships between probabilistic Boolean networks and dynamic Bayesian networks as models of gene regulatory networks [O] . Harri Lähdesmäki, Sampsa Hautaniemi, Ilya Shmulevich, -1

机译：概率布尔网络与动态贝叶斯网络之间的关系作为基因调控网络的模型
7. Modeling Relationships in Referential Expressions with Compositional Modular Networks [O] . Hu, Ronghang, Rohrbach, Marcus, Andreas, Jacob, 2016

机译：参照表达式与组合关系的建模模块化网络

Modeling Relationships in Referential Expressions with Compositional Modular Networks

摘要

著录项

相似文献

相关主题

期刊订阅