首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Modeling Relationships in Referential Expressions with Compositional Modular Networks
【24h】

Modeling Relationships in Referential Expressions with Compositional Modular Networks

机译:用组成模块化网络建模关系中的参照表达

获取原文

摘要

People often refer to entities in an image in terms of their relationships with other entities. For example, the black cat sitting under the table refers to both a black cat entity and its relationship with another table entity. Understanding these relationships is essential for interpreting and grounding such natural language expressions. Most prior work focuses on either grounding entire referential expressions holistically to one region, or localizing relationships based on a fixed set of categories. In this paper we instead present a modular deep architecture capable of analyzing referential expressions into their component parts, identifying entities and relationships mentioned in the input expression and grounding them all in the scene. We call this approach Compositional Modular Networks (CMNs): a novel architecture that learns linguistic analysis and visual inference end-to-end. Our approach is built around two types of neural modules that inspect local regions and pairwise interactions between regions. We evaluate CMNs on multiple referential expression datasets, outperforming state-of-the-art approaches on all tasks.
机译:人们经常在与其他实体的关系方面引用图像中的实体。例如,坐在桌面下的黑猫是指黑猫实体和与另一表实体的关系。了解这些关系对于解释和接地此类自然语音表现至关重要。大多数事先工作侧重于整体参考表达式到一个区域,或基于固定的类别集合定位关系。在本文中,我们介绍了一种模块化的深层架构,能够分析到它们的组成部分中的参考表达式,识别输入表达式中提到的实体和关系并将它们全部接地。我们称之为这种方法组成模块化网络(CMN):一种学习语言分析和视觉推断结束结束的新型架构。我们的方法是围绕两种类型的神经模块构成,检查当地区域和地区之间的成对相互作用。我们在多个引用表达式数据集上评估CMNS,在所有任务上表现出最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号