首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Multi-scale Location-Aware Kernel Representation for Object Detection
【24h】

Multi-scale Location-Aware Kernel Representation for Object Detection

机译:用于目标检测的多尺度位置感知内核表示

获取原文

摘要

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods demonstrate that the integration of high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so that they cannot be directly adopted to object detection. In this paper, we make an attempt to exploit high-order statistics in object detection, aiming at generating more discriminative representations for proposals to enhance the performance of detectors. To this end, we propose a novel Multi-scale Location-aware Kernel Representation (MLKP) to capture high-order statistics of deep features in proposals. Our MLKP can be efficiently computed on a modified multi-scale feature map using a low-dimensional polynomial kernel approximation. Moreover, different from existing orderless global representations based on high-order statistics, our proposed MLKP is location retentive and sensitive so that it can be flexibly adopted to object detection. Through integrating into Faster R-CNN schema, the proposed MLKP achieves very competitive performance with state-of-the-art methods, and improves Faster R-CNN by 4.9% (mAP), 4.7% (mAP) and 5.0% (AP at IOU=[0.5:0.05:0.95]) on PASCAL VOC 2007, VOC 2012 and MS COCO benchmarks, respectively. Code is available at: https://github.com/Hwang64/MLKP.
机译:尽管Faster R-CNN及其变体在对象检测中显示出令人鼓舞的性能,但它们仅利用对象建议的简单一阶表示形式进行最终分类和回归。最近的分类方法表明,将高阶统计量集成到深度卷积神经网络中可以实现令人印象深刻的改进,但是它们的目标是通过丢弃位置信息来对整个图像进行建模,从而使它们不能直接用于对象检测。在本文中,我们尝试在对象检测中利用高阶统计量,旨在为建议提供更具区分性的表示,以提高检测器的性能。为此,我们提出了一种新颖的多尺度位置感知内核表示(MLKP),以捕获提案中深层特征的高阶统计信息。使用低维多项式核逼近,可以在修改后的多尺度特征图上高效地计算我们的MLKP。而且,与现有的基于高阶统计量的无序全局表示法不同,我们提出的MLKP具有位置保持性和敏感性,因此可以灵活地应用于对象检测。通过集成到Faster R-CNN模式中,拟议的MLKP使用最新方法实现了非常有竞争力的性能,并将Faster R-CNN分别提高了4.9%(mAP),4.7%(mAP)和5.0%(AP分别在PASCAL VOC 2007,VOC 2012和MS COCO基准上的IOU = [0.5:0.05:0.95])。可以从以下网址获得代码:https://github.com/Hwang64/MLKP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号