首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing
【24h】

Learning a Hierarchical Deformable Template for Rapid Deformable Object Parsing

机译:学习用于快速可变形对象解析的分层可变形模板

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we address the tasks of detecting, segmenting, parsing, and matching deformable objects. We use a novel probabilistic object model that we call a hierarchical deformable template (HDT). The HDT represents the object by state variables defined over a hierarchy (with typically five levels). The hierarchy is built recursively by composing elementary structures to form more complex structures. A probability distribution-a parameterized exponential model-is defined over the hierarchy to quantify the variability in shape and appearance of the object at multiple scales. To perform inference-to estimate the most probable states of the hierarchy for an input image-we use a bottom-up algorithm called compositional inference. This algorithm is an approximate version of dynamic programming where approximations are made (e.g., pruning) to ensure that the algorithm is fast while maintaining high performance. We adapt the structure-perceptron algorithm to estimate the parameters of the HDT in a discriminative manner (simultaneously estimating the appearance and shape parameters). More precisely, we specify an exponential distribution for the HDT using a dictionary of potentials, which capture the appearance and shape cues. This dictionary can be large and so does not require handcrafting the potentials. Instead, structure-perceptron assigns weights to the potentials so that less important potentials receive small weights (this is like a ?soft? form of feature selection). Finally, we provide experimental evaluation of HDTs on different visual tasks, including detection, segmentation, matching (alignment), and parsing. We show that HDTs achieve state-of-the-art performance for these different tasks when evaluated on data sets with groundtruth (and when compared to alternative algorithms, which are typically specialized to each task).
机译:在本文中,我们解决了检测,分割,解析和匹配可变形对象的任务。我们使用一种称为概率可变形模板(HDT)的新型概率对象模型。 HDT通过在层次结构(通常具有五个级别)上定义的状态变量来表示对象。通过组合基本结构以形成更复杂的结构来递归构建层次结构。在层次结构上定义了概率分布(一种参数化的指数模型),以量化对象在多个尺度上的形状和外观的变化。为了进行推理(估计输入图像的层次结构最可能的状态),我们使用了一种自下而上的算法,称为合成推理。该算法是动态编程的近似版本,在其中进行近似(例如修剪)以确保算法在保持高性能的同时快速运行。我们采用结构感知器算法来区别地估计HDT的参数(同时估计外观和形状参数)。更准确地说,我们使用势能字典指定HDT的指数分布,该势能字典捕获外观和形状提示。该词典可能很大,因此不需要手工制作电位。取而代之的是,结构感知器将权重分配给电势,以便不太重要的电势获得较小的权重(这就像特征选择的“软”形式)。最后,我们提供了HDT在不同视觉任务上的实验评估,包括检测,分割,匹配(对齐)和解析。我们证明,当对具有groundtruth的数据集进行评估时(以及与通常专用于每个任务的替代算法进行比较时),HDT可以为这些不同的任务实现最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号