Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing

首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing

【24h】

Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing

机译：学习用于快速可变形对象解析的分层可变形模板

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we address the tasks of detecting, segmenting, parsing, and matching deformable objects. We use a novel probabilistic object model that we call a hierarchical deformable template (HDT). The HDT represents the object by state variables defined over a hierarchy (with typically five levels). The hierarchy is built recursively by composing elementary structures to form more complex structures. A probability distribution-a parameterized exponential model-is defined over the hierarchy to quantify the variability in shape and appearance of the object at multiple scales. To perform inference-to estimate the most probable states of the hierarchy for an input image-we use a bottom-up algorithm called compositional inference. This algorithm is an approximate version of dynamic programming where approximations are made (e.g., pruning) to ensure that the algorithm is fast while maintaining high performance. We adapt the structure-perceptron algorithm to estimate the parameters of the HDT in a discriminative manner (simultaneously estimating the appearance and shape parameters). More precisely, we specify an exponential distribution for the HDT using a dictionary of potentials, which capture the appearance and shape cues. This dictionary can be large and so does not require handcrafting the potentials. Instead, structure-perceptron assigns weights to the potentials so that less important potentials receive small weights (this is like a ?soft? form of feature selection). Finally, we provide experimental evaluation of HDTs on different visual tasks, including detection, segmentation, matching (alignment), and parsing. We show that HDTs achieve state-of-the-art performance for these different tasks when evaluated on data sets with groundtruth (and when compared to alternative algorithms, which are typically specialized to each task).

机译：在本文中，我们解决了检测，分割，解析和匹配可变形对象的任务。我们使用一种称为概率可变形模板（HDT）的新型概率对象模型。 HDT通过在层次结构（通常具有五个级别）上定义的状态变量来表示对象。通过组合基本结构以形成更复杂的结构来递归构建层次结构。在层次结构上定义了概率分布（一种参数化的指数模型），以量化对象在多个尺度上的形状和外观的变化。为了进行推理（估计输入图像的层次结构最可能的状态），我们使用了一种自下而上的算法，称为合成推理。该算法是动态编程的近似版本，在其中进行近似（例如修剪）以确保算法在保持高性能的同时快速运行。我们采用结构感知器算法来区别地估计HDT的参数（同时估计外观和形状参数）。更准确地说，我们使用势能字典指定HDT的指数分布，该势能字典捕获外观和形状提示。该词典可能很大，因此不需要手工制作电位。取而代之的是，结构感知器将权重分配给电势，以便不太重要的电势获得较小的权重（这就像特征选择的“软”形式）。最后，我们提供了HDT在不同视觉任务上的实验评估，包括检测，分割，匹配（对齐）和解析。我们证明，当对具有groundtruth的数据集进行评估时（以及与通常专用于每个任务的替代算法进行比较时），HDT可以为这些不同的任务实现最先进的性能。

著录项

来源
《Pattern Analysis and Machine Intelligence, IEEE Transactions on》 |2010年第6期|p.1029-1043|共15页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hierarchy; object parsing; segmentation; shape matching; shape representation; structured learning.;

机译：层次结构;对象解析;分段;形状匹配;形状表示;结构化学习。;

相似文献

外文文献
中文文献
专利

1. Max margin learning of hierarchical configural deformable templates (HCDTs) for efficient object parsing and pose estimation [J] . Zhu L., Chen Y., Lin C., International Journal of Computer Vision . 2011,第1期

机译：分层结构可变形模板（HCDT）的最大余量学习，可进行有效的对象解析和姿势估计
2. Max Margin Learning of Hierarchical Configural Deformable Templates (HCDTs) for Efficient Object Parsing and Pose Estimation [J] . Long (Leo) Zhu, Yuanhao Chen, Chenxi Lin, International Journal of Computer Vision . 2011,第1期

机译：分层结构可变形模板（HCDT）的最大边距学习，可实现有效的对象解析和姿势估计
3. Constellational contour parsing for deformable object detection [J] . Huang Chen, Han Tony X., He Zhihai, Journal of visual communication & image representation . 2016,第jula期

机译：星座轮廓解析用于可变形物体检测
4. Online Object Tracking and Learning with Sparse Deformable Template Models [C] . Bowen Shi, Tianzhe Fan, Qun Liu International Symposium on Computational Intelligence and Design . 2015

机译：用稀疏可变形模板型号的在线对象跟踪和学习
5. Deformable Object Manipulation: Learning While Doing [D] . McConachie, Dale. 2020

机译：可变形的物体操纵：在做时学习
6. Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects [O] . Tristan J. Webb, Edmund T. Rolls 2014

机译：特定于变形和不变变形的视觉对象识别：人物和变形对象的姿势与身份识别
7. Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing [O] . Long, Zhu, Chen, Yuanhao, Yuille, Alan 2010

机译：学习用于快速可变形对象分析的分层可变形模板

Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅