Natural Language Object Retrieval

机译：自然语言对象检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the task of natural language object retrieval, to localize a target object within a given image based on a natural language query of the object. Natural language object retrieval differs from text-based image retrieval task as it involves spatial information about objects within the scene and global scene context. To address this issue, we propose a novel Spatial Context Recurrent ConvNet (SCRC) model as scoring function on candidate boxes for object retrieval, integrating spatial configurations and global scene-level contextual information into the network. Our model processes query text, local image descriptors, spatial configurations and global context features through a recurrent network, outputs the probability of the query text conditioned on each candidate box as a score for the box, and can transfer visual-linguistic knowledge from image captioning domain to our task. Experimental results demonstrate that our method effectively utilizes both local and global information, outperforming previous baseline methods significantly on different datasets and scenarios, and can exploit large scale vision and language datasets for knowledge transfer.

机译：在本文中，我们解决了自然语言对象检索的任务，即基于对象的自然语言查询在给定图像中定位目标对象。自然语言对象检索不同于基于文本的图像检索任务，因为它涉及有关场景和全局场景上下文中的对象的空间信息。为了解决此问题，我们提出了一种新颖的空间上下文循环ConvNet（SCRC）模型，作为对对象检索的候选框的评分功能，将空间配置和全局场景级上下文信息集成到网络中。我们的模型通过循环网络处理查询文本，本地图像描述符，空间配置和全局上下文特征，将以每个候选框为条件的查询文本的概率输出为该框的分数，并可以从图像标题转移视觉语言知识域到我们的任务。实验结果表明，我们的方法有效地利用了本地和全局信息，在不同的数据集和场景中均明显优于以前的基线方法，并且可以利用大规模的视觉和语言数据集进行知识转移。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|4555-4564|共10页
会议地点
作者
Ronghang Hu; Huazhe Xu; Marcus Rohrbach; Jiashi Feng; Kate Saenko; Trevor Darrell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Natural languages; Context; Context modeling; Predictive models; Adaptation models; Feature extraction; Training;

机译：自然语言;上下文;上下文建模;预测模型;适应模型;特征提取;训练;

相似文献

外文文献
中文文献
专利

1. Natural language guided object retrieval in images [J] . Ostovar Ahmad, Bensch Suna, Hellstrom Thomas Acta Informatica . 2021,第4期

机译：自然语言指导对象在图像中检索
2. Natural-language-based intelligent retrieval engine for BIM object database [J] . Wu Songfei, Shen Qiyu, Deng Yichuan, Computers in Industry . 2019,第期

机译：基于自然语言的智能检索引擎，用于BIM对象数据库
3. The extent to which human language may have a materialist substrate: Language as a natural object.Comment on "Interaction between lexical and grammatical language systems in the brain" by Alfredo Ardila.(Note) [J] . Buckingham H.W., Christman S.S. Physics of life reviews . 2012,第3期

机译：人类语言在某种程度上可能具有唯物主义的底蕴：语言是自然物。阿尔弗雷多·阿迪拉（Alfredo Ardila）发表的“大脑中的词汇和语法语言系统之间的相互作用”评论。（注）
4. Natural Language Object Retrieval [C] . Ronghang Hu, Huazhe Xu, Marcus Rohrbach, IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：自然语言对象检索
5. Arabic interactive cross-language information retrieval via natural language processing. [D] . Malki, Ahmed. 2001

机译：通过自然语言处理获取阿拉伯语交互式跨语言信息。
6. Terminology spectrum analysis of natural-language chemical documents: term-like phrases retrieval routine [O] . Boris L. Alperin, Andrey O. Kuzmin, Ludmila Yu. Ilina, 2016

机译：天然语言化学文献的术语谱分析：类词短语检索例程
7. Natural Language Object Retrieval [O] . Hu, Ronghang, Xu, Huazhe, Rohrbach, Marcus, 2016

机译：自然语言对象检索

Natural Language Object Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅