首页> 外文会议>European conference on computer vision >Instance-Level Human Parsing via Part Grouping Network
【24h】

Instance-Level Human Parsing via Part Grouping Network

机译:通过零件分组网络进行实例级人员解析

获取原文

摘要

Instance-level human parsing towards real-world human analysis scenarios is still under-explored due to the absence of sufficient data resources and technical difficulty in parsing multiple instances in a single pass. Several related works all follow the "parsing-by-detection" pipeline that heavily relies on separately trained detection models to localize instances and then performs human parsing for each instance sequentially. Nonetheless, two discrepant optimization targets of detection and parsing lead to suboptimal representation learning and error accumulation for final results, fn this work, we make the first attempt to explore a detection-free Part Grouping Network (PGN) for efficiently parsing multiple people in an image in a single pass. Our PGN reformulates instance-level human parsing as two twinned sub-tasks that can be jointly learned and mutually refined via a unified network: (1) semantic part segmentation for assigning each pixel as a human part (e.g., face, arms); (2) instance-aware edge detection to group semantic parts into distinct person instances. Thus the shared intermediate representation would be endowed with capabilities in both characterizing finegrained parts and inferring instance belongings of each part. Finally, a simple instance partition process is employed to get final results during inference. We conducted experiments on PASCAL-Person-Part dataset and our PGN outperforms all state-of-the-art methods. Furthermore, we show its superiority on a newly collected multi-person parsing dataset (CIHP) including 38,280 diverse images, which is the largest dataset so far and can facilitate more advanced human analysis. The CIHP benchmark and our source code are available at http://sysu-hcp.net/lip/.
机译:由于缺乏足够的数据资源和单次解析多个实例的技术难度,针对实际人类分析场景的实例级人类解析仍未得到充分研究。几项相关的工作都遵循“按检测解析”管道,该流程严重依赖单独训练的检测模型来定位实例,然后依次对每个实例执行人工解析。但是,检测和解析的两个不同的优化目标导致次优的表示学习和错误累积,以获得最终结果。在这项工作中,我们首次尝试探索一种免检测的零件分组网络(PGN),以有效地解析一个人中的多个人。一次通过图像。我们的PGN将实例级人员解析重新构造为两个孪生子任务,可以通过一个统一网络共同学习和相互完善:(1)语义部分分割,用于将每个像素分配为人员部分(例如,面部,手臂); (2)实例感知边缘检测可将语义部分分为不同的人实例。因此,共享的中间表示将具有表征细粒度部分和推断每个部分的实例所有物的能力。最后,采用一个简单的实例分区过程来获得推理过程中的最终结果。我们在PASCAL-Person-Part数据集上进行了实验,我们的PGN优于所有最新方法。此外,我们在包括38,280张不同图像的新收集的多人解析数据集(CIHP)上显示了其优越性,这是迄今为止最大的数据集,可以促进更高级的人类分析。 CIHP基准测试和我们的源代码可从http://sysu-hcp.net/lip/获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号