首页> 外文会议>Asian Conference on Computer Vision >PIRC Net: Using Proposal Indexing, Relationships and Context for Phrase Grounding
【24h】

PIRC Net: Using Proposal Indexing, Relationships and Context for Phrase Grounding

机译:PIRC Net:使用提案索引,关系和上下文进行词组定位

获取原文

摘要

Phrase Grounding aims to detect and localize objects in images that are referred to and are queried by natural language phrases. Phrase grounding finds applications in tasks such as Visual Dialog, Visual Search and Image-text co-reference resolution. In this paper, we present a framework that leverages information such as phrase category, relationships among neighboring phrases in a sentence and context to improve the performance of phrase grounding systems. We propose three modules: Proposal Indexing Network (PIN); Inter-phrase Regression Network (IRN) and Proposal Ranking Network (PRN) each of which analyze the region proposals of an image at increasing levels of detail by incorporating the above information. Also, in the absence of ground-truth spatial locations of the phrases (weakly-supervised), we propose knowledge transfer mechanisms that leverages the framework of PIN module. We demonstrate the effectiveness of our approach on the Flickr 30k Entities and ReferItGame datasets, for which we achieve improvements over state-of-the-art approaches in both supervised and weakly-supervised variants.
机译:短语接地旨在检测和定位自然语言短语引用并查询的图像中的对象。短语基础可在诸如可视对话框,可视搜索和图像文本共同引用解析之类的任务中找到应用程序。在本文中,我们提出了一个框架,该框架利用诸如短语类别,句子中相邻短语之间的关系以及上下文之类的信息来提高短语基础系统的性能。我们提出了三个模块:提案索引网络(PIN);短语间回归网络(IRN)和提案排名网络(PRN),它们各自通过合并以上信息来分析图像区域提案的详细程度不断提高。同样,在没有短语的地面真实位置(弱监督)的情况下,我们提出了利用PIN模块框架的知识转移机制。我们在Flickr 30k实体和ReferItGame数据集上证明了我们的方法的有效性,为此,我们在有监督和无监督变体中都实现了对最新方法的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号