...
首页> 外文期刊>International Journal of Computer Vision >End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection
【24h】

End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection

机译:对象检测的基于潜在可变形的零件的表示的端到端学习

获取原文
获取原文并翻译 | 示例
           

摘要

Object detection methods usually represent objects through rectangular bounding boxes from which they extract features, regardless of their actual shapes. In this paper, we apply deformations to regions in order to learn representations better fitted to objects. We introduce DP-FCN, a deep model implementing this idea by learning to align parts to discriminative elements of objects in a latent way, i.e. without part annotation. This approach has two main assets: it builds invariance to local transformations, thus improving recognition, and brings geometric information to describe objects more finely, leading to a more accurate localization. We further develop both features in a new model named DP-FCN2.0 by explicitly learning interactions between parts. Alignment is done with an in-network joint optimization of all parts based on a CRF with custom potentials, and deformations are influencing localization through a bilinear product. We validate our models on PASCAL VOC and MS COCO datasets and show significant gains. DP-FCN2.0 achieves state-of-the-art results of 83.3 and 81.2% on VOC 2007 and 2012 with VOC data only.
机译:对象检测方法通常表示通过矩形边界框的对象,无论其实际形状如何。在本文中,我们将变形应用于地区,以便学习更好地适合物体的表示。我们介绍了DP-FCN,这是一种深入的模型,通过学习实现这个想法,以以潜在的方式对准物体的鉴别元素,即没有部分注释。这种方法有两个主要资产:它建立了与本地转换的不变性,从而提高了识别,并带来了更精细地描述对象的几何信息,导致了更准确的本地化。我们通过在零件之间显式学习交互,在名为DP-FCN2.0的新模型中进一步开发了两个功能。通过基于具有定制电位的CRF的所有部件的网络联合优化进行对准,并且变形正在影响通过双线性产品的定位。我们在Pascal VOC和MS Coco Datasets上验证我们的模型,并显示出显着的收益。 DP-FCN2.0仅在VOC 2007和2012上实现最新的结果83.3和81.2%,仅限VOC数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号