End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection

Mordan Taylor; Thome Nicolas; Henaff Gilles; Cord Matthieu

首页> 外文期刊>International Journal of Computer Vision >End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection

【24h】

End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection

机译：对象检测的基于潜在可变形的零件的表示的端到端学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object detection methods usually represent objects through rectangular bounding boxes from which they extract features, regardless of their actual shapes. In this paper, we apply deformations to regions in order to learn representations better fitted to objects. We introduce DP-FCN, a deep model implementing this idea by learning to align parts to discriminative elements of objects in a latent way, i.e. without part annotation. This approach has two main assets: it builds invariance to local transformations, thus improving recognition, and brings geometric information to describe objects more finely, leading to a more accurate localization. We further develop both features in a new model named DP-FCN2.0 by explicitly learning interactions between parts. Alignment is done with an in-network joint optimization of all parts based on a CRF with custom potentials, and deformations are influencing localization through a bilinear product. We validate our models on PASCAL VOC and MS COCO datasets and show significant gains. DP-FCN2.0 achieves state-of-the-art results of 83.3 and 81.2% on VOC 2007 and 2012 with VOC data only.

机译：对象检测方法通常表示通过矩形边界框的对象，无论其实际形状如何。在本文中，我们将变形应用于地区，以便学习更好地适合物体的表示。我们介绍了DP-FCN，这是一种深入的模型，通过学习实现这个想法，以以潜在的方式对准物体的鉴别元素，即没有部分注释。这种方法有两个主要资产：它建立了与本地转换的不变性，从而提高了识别，并带来了更精细地描述对象的几何信息，导致了更准确的本地化。我们通过在零件之间显式学习交互，在名为DP-FCN2.0的新模型中进一步开发了两个功能。通过基于具有定制电位的CRF的所有部件的网络联合优化进行对准，并且变形正在影响通过双线性产品的定位。我们在Pascal VOC和MS Coco Datasets上验证我们的模型，并显示出显着的收益。 DP-FCN2.0仅在VOC 2007和2012上实现最新的结果83.3和81.2％，仅限VOC数据。

著录项

来源
《International Journal of Computer Vision》 |2019年第12期|共21页
作者
Mordan Taylor; Thome Nicolas; Henaff Gilles; Cord Matthieu;
展开▼
作者单位

Sorbonne Univ CNRS Lab Informat Paris 6 LIP6 F-75005 Paris France;

Conservatoire Natl Arts &

Metiers CEDRIC 292 Rue St Martin F-75003 Paris France;

Thales Land &

Air Syst 2 Ave Gay Lussac F-78990 Elancourt France;

Sorbonne Univ CNRS Lab Informat Paris 6 LIP6 F-75005 Paris France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Object detection; Fully convolutional network; Deep learning; Part-based representation; End-to-end latent part learning;

机译：对象检测;完全卷积网络;深入学习;基于部分的代表;结束潜在学习;

相似文献

外文文献
中文文献
专利

1. End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection [J] . Mordan Taylor, Thome Nicolas, Henaff Gilles, International Journal of Computer Vision . 2019,第11a12期

机译：对象检测的基于潜在可变形的零件的表示的端到端学习
2. Part-based deformable object detection with a single sketch [J] . Sreyasee Das Bhattacharjee, Anurag Mittal Computer vision and image understanding . 2015,第octa期

机译：单一草图的基于零件的可变形对象检测
3. Deformable Part-Based Model Transfer for Object Detection [J] . Zhiwei RUAN, Guijin WANG, Xinggang LIN, IEICE transactions on information and systems . 2014,第5期

机译：用于物体检测的可变形零件的模型传输
4. Fusing generic objectness and deformable part-based models for weakly supervised object detection [C] . Yuxing Tang, Xiaofang Wang, Dellandrea Emmanuel, IEEE International Conference on Image Processing . 2014

机译：融合通用对象和基于零件的可变形模型以进行弱监督的对象检测
5. Shape Perception as Bayesian Inference of Modality-Independent Part-Based 3D Object-Centered Shape Representations [D] . Erdogan, Goker. 2017

机译：形状感知作为基于模态的基于零件的3D对象中心形状表示的贝叶斯推理
6. Reinforced AdaBoost Learning for Object Detection with Local Pattern Representations [O] . Younghyun Lee, David K. Han, Hanseok Ko 2013

机译：增强的AdaBoost学习用于使用局部模式表示进行对象检测
7. Weakly Supervised Learning of Deformable Part-Based Models for Object Detection via Region Proposals [O] . Yuxing Tang, Xiaofang Wang, Emmanuel Dellandrea, 2017

机译：通过区域建议对物体检测的可变形零件模型进行弱监督学习

End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection

摘要

著录项

相似文献

相关主题

期刊订阅