Semantic R-CNN for Natural Language Object Detection

机译：用于自然语言对象检测的语义R-CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a simple and effective framework for natural language object detection, to localize a target within an image based on description of the target. The method, called semantic R-CNN, extends RPN (Region Proposal Network) [1] by adding LSTM [20J module for processing natural language query text. LSTM [20] module take encoded query text and image descriptors as input and output the probability of the query text conditioned on visual features of candidate box and whole image. Those candidate boxes are generated by RPN and their local features are extracted by ROI pooling. RPN can be initialized from pre-trained Faster R-CNN model [1], transfers object visual knowledge from traditional object detection domain to our task. Experimental results demonstrate that our method significantly outperform previous baseline SCRC (Spatial Context Recurrent ConvNet) [7] model on Referit dataset [8], moreover, our model is simple to train similar to Faster R-CNN.

机译：在本文中，我们提出了一种简单有效的自然语言对象检测框架，用于基于目标的描述在图像中定位目标。该方法称为语义R-CNN，它通过添加用于处理自然语言查询文本的LSTM [20J]模块来扩展RPN（区域提议网络）[1]。 LSTM [20]模块将编码的查询文本和图像描述符作为输入，并输出以候选框和整个图像的视觉特征为条件的查询文本的概率。这些候选框由RPN生成，其局部特征由ROI池提取。 RPN可以从预先训练的Faster R-CNN模型[1]中初始化，将物体的视觉知识从传统的物体检测领域转移到我们的任务中。实验结果表明，我们的方法显着胜过Referit数据集[8]上的先前基线SCRC（空间上下文递归ConvNet）模型[7]，此外，与Faster R-CNN相似，我们的模型易于训练。

著录项

来源
《Pacific-Rim conference on multimedia》|2018年|98-107|共10页
会议地点
作者
Shuxiong Ye; Zheng Qin; Kaiping Xu; Kai Huang; Guolong Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Object detection; Natural language; RPN;

机译：对象检测;自然语言; RPN;

相似文献

外文文献
中文文献
专利

1. ME R-CNN: Multi-Expert R-CNN for Object Detection [J] . Lee Hyungtae, Eum Sungmin, Kwon Heesung IEEE Transactions on Image Processing . 2020,第期

机译：ME R-CNN：用于对象检测的多专家R-CNN
2. Semantic framework for mapping object-oriented model to semantic web languages [J] . Petr Je?ek, Roman Mou?ek Frontiers in Neuroinformatics . 2015,第1期

机译：用于将面向对象模型映射到语义Web语言的语义框架
3. Semantic Analysis of Natural Language Queries for an Object Oriented Database [J] . Bentamar Hemerelain, Hafida Belbachir Journal of Software Engineering and Applications . 2010,第11期

机译：面向对象数据库的自然语言查询的语义分析
4. Semantic R-CNN for Natural Language Object Detection [C] . Shuxiong Ye, Zheng Qin, Kaiping Xu, Pacific-Rim Conference on Multimedia . 2018

机译：语义R-CNN用于自然语言对象检测
5. Semantic Similarity Detection in Natural Language Documents. [D] . Zhao, Lianyu. 2012

机译：自然语言文档中的语义相似性检测。
6. Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction [O] . Jinpeng Mi, Hongzhuo Liang, Nikolaos Katsakis, 2020

机译：通过对象负担检测和意图语义提取实现与意图相关的自然语言基础
7. Intention-Related Natural Language Grounding via Object Affordance Detection and Intention Semantic Extraction [O] . Jinpeng Mi, Hongzhuo Liang, Nikolaos Katsakis, 2020

机译：有意相关的自然语言通过对象可承受检测和意图语义提取

Semantic R-CNN for Natural Language Object Detection

摘要

著录项

相似文献

相关主题

期刊订阅