首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Parsing Based on Parselets: A Unified Deformable Mixture Model for Human Parsing
【24h】

Parsing Based on Parselets: A Unified Deformable Mixture Model for Human Parsing

机译:基于Parselets的解析:用于人解析的统一可变形混合模型

获取原文
获取原文并翻译 | 示例
           

摘要

Human parsing, namely partitioning the human body into semantic regions, has drawn much attention recently for its wide applications in human-centric analysis. Previous works often consider solving the problem of human pose estimation as the prerequisite of human parsing. We argue that these approaches cannot obtain optimal pixel-level parsing due to the inconsistent targets between the different tasks. In this work, we directly address the problem of human parsing by using the novel Parselet representation as the building blocks of our parsing model. Parselets are a group of parsable segments which can generally be obtained by low-level over-segmentation algorithms and bear strong semantic meaning. We then build a deformable mixture parsing model (DMPM) for human parsing to simultaneously handle the deformation and multi-modalities of Parselets. The proposed model has two unique characteristics: (1) the possible numerous modalities of Parselet ensembles are exhibited as the “And-Or” structure of sub-trees; (2) to further solve the practical problem of Parselet occlusion or absence, we directly model the visibility property at some leaf nodes. The DMPM thus directly solves the problem of human parsing by searching for the best graph configuration from a pool of Parselet hypotheses without intermediate tasks. Fast rejection based on hierarchical filtering is employed to ensure the overall efficiency. Comprehensive evaluations on a new large-scale human parsing dataset, which is crawled from the Internet, with high resolution and thoroughly annotated semantic labels at pixel-level, and also a benchmark dataset demonstrate the encouraging performance of the proposed approach.
机译:人体分析,即将人体划分为语义区域,由于其在以人为中心的分析中的广泛应用,最近引起了很多关注。先前的作品通常认为解决人体姿势估计问题是人体分析的前提。我们认为,由于不同任务之间的目标不一致,因此这些方法无法获得最佳的像素级解析。在这项工作中,我们通过使用新颖的Parselet表示作为我们的解析模型的构建块,直接解决了人类解析的问题。 Parselet是一组可解析的段,通常可以通过低级过分分割算法获得这些段,并具有很强的语义含义。然后,我们建立一个可变形的混合解析模型(DMPM)进行人类解析,以同时处理Parselets的变形和多模式。所提出的模型具有两个独特的特征:(1)Parselet合奏的可能多种形式表现为子树的“与或”结构; (2)为了进一步解决Parselet闭塞或缺失的实际问题,我们直接对某些叶节点的可见性进行建模。因此,DMPM通过从Parselet假设池中搜索最佳图形配置来直接解决人工分析问题,而无需中间任务。基于分层过滤的快速拒绝被用来确保整体效率。对从互联网上爬出的具有高分辨率和像素级语义标签的完整注释的新的大规模人类解析数据集的综合评估以及基准数据集证明了该方法的令人鼓舞的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号