Gated Bi-directional CNN for Object Detection

机译：用于物体检测的门控双向CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The visual cues from multiple support regions of different sizes and resolutions are complementary in classifying a candidate box in object detection. How to effectively integrate local and contextual visual cues from these regions has become a fundamental problem in object detection. Most existing works simply concatenated features or scores obtained from support regions. In this paper, we proposal a novel gated bi-directional CNN (GBD-Net) to pass messages between features from different support regions during both feature learning and feature extraction. Such message passing can be implemented through convolution in two directions and can be conducted in various layers. Therefore, local and contextual visual patterns can validate the existence of each other by learning their nonlinear relationships and their close iterations are modeled in a much more complex way. It is also shown that message passing is not always helpful depending on individual samples. Gated functions are further introduced to control message transmission and their on-and-off is controlled by extra visual evidence from the input sample. GBD-Net is implemented under the Fast RCNN detection framework. Its effectiveness is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO.

机译：来自不同尺寸和分辨率的多个支持区域的视觉提示在分类对象检测中的候选盒中是互补的。如何从这些区域有效地整合本地和上下文视觉线索已成为对象检测中的根本问题。大多数现有的作品只是从支持区域获得的连接功能或分数。在本文中，我们提出了一种新型门控双向CNN（GBD-Net），以在特征学习和特征提取期间通过不同支撑区域的特征之间的消息。可以通过两个方向上的卷积来实现这样的消息传递，并且可以在各种层中进行。因此，本地和上下文的视觉模式可以通过学习其非线性关系来验证彼此的存在，并且它们的密切迭代以更复杂的方式建模。还表明，根据单个样本，消息传递并不总是有用。进一步引入门控功能以控制消息传输，并通过从输入样本的额外视觉证据控制它们的开关。 GBD-Net是在快速RCNN检测框架下实施的。通过对三个物体检测数据集，Imagenet，Pascal VOC2007和Microsoft Coco的实验显示其有效性。

著录项

来源
《European Conference on Computer Vision》|2016年|869p|共16页
会议地点
作者
Xingyu Zeng; Wanli Ouyang; Bin Yang; Junjie Yan; Xiaogang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
入库时间 2022-08-20 20:08:25

相似文献

外文文献
中文文献
专利

1. Gated CNN: Integrating multi-scale feature layers for object detection [J] . Yuan Jin, Xiong Heng-Chang, Xiao Yi, Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：门控CNN：集成多尺度特征层进行对象检测
2. Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection [J] . Fang Fen, Li Liyuan, Zhu Hongyuan, IEEE Transactions on Image Processing . 2020,第期

机译：组合更快的R-CNN和模型驱动聚类，用于细长物体检测
3. ME R-CNN: Multi-Expert R-CNN for Object Detection [J] . Lee Hyungtae, Eum Sungmin, Kwon Heesung IEEE Transactions on Image Processing . 2020,第期

机译：ME R-CNN：用于对象检测的多专家R-CNN
4. Gated Bi-directional CNN for Object Detection [C] . Xingyu Zeng, Wanli Ouyang, Bin Yang, European conference on computer vision . 2016

机译：门控双向CNN用于物体检测
5. Multi-Source UAV-Based Object Classification Using CNN's and Data Acquisition System for Robotic Skin [D] . Raghavendra Sriram, M.S. 2017

机译：基于CNN和数据采集系统的机器人皮肤基于多源无人机的目标分类
6. Image Captioning Using Motion-CNN with Object Detection [O] . Kiyohiko Iwamura, Jun Younes Louhi Kasahara, Alessandro Moro, 2021

机译：使用具有对象检测的Motion-CNN的图像标题
7. Concurrent Segmentation and Object Detection CNNs for Aircraft Detection and Identification in Satellite Images [O] . Damien Grosgeorge, Maxime Arbelot, Alex Goupilleau, 2020

机译：用于飞机检测和卫星图像识别的并发分段和对象检测CNN

Gated Bi-directional CNN for Object Detection

摘要

著录项

相似文献

相关主题

期刊订阅