...
首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Parameter Estimation and Energy Minimization for Region-Based Semantic Segmentation
【24h】

Parameter Estimation and Energy Minimization for Region-Based Semantic Segmentation

机译:基于区域的语义分割的参数估计和能量最小化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We consider the problem of parameter estimation and energy minimization for a region-based semantic segmentation model. The model divides the pixels of an image into non-overlapping connected regions, each of which is to a semantic class. In the context of energy minimization, the main problem we face is the large number of putative pixel-to-region assignments. We address this problem by designing an accurate linear programming based approach for selecting the best set of regions from a large dictionary. The dictionary is constructed by merging and intersecting segments obtained from multiple bottom-up over-segmentations. The linear program is solved efficiently using dual decomposition. In the context of parameter estimation, the main problem we face is the lack of fully supervised data. We address this issue by developing a principled framework for parameter estimation using diverse data. More precisely, we propose a latent structural support vector machine formulation, where the latent variables model any missing information in the human annotation. Of particular interest to us are three types of annotations: (i) images segmented using generic foreground or background classes; (ii) images with bounding boxes specified for objects; and (iii) images labeled to indicate the presence of a class. Using large, publicly available datasets we show that our methods are able to significantly improve the accuracy of the region-based model.
机译:我们考虑了基于区域的语义分割模型的参数估计和能量最小化的问题。该模型将图像的像素划分为非重叠的连接区域,每个区域都属于一个语义类。在能量最小化的背景下,我们面临的主要问题是大量假定的像素到区域分配。我们通过设计一种基于精确线性规划的方法来解决此问题,该方法用于从大型词典中选择最佳区域集。该字典是通过合并和相交从多个自下而上的过度细分中获得的细分构成的。使用对偶分解可以有效地求解线性程序。在参数估计的背景下,我们面临的主要问题是缺乏完全监督的数据。我们通过开发用于使用各种数据进行参数估计的原则性框架来解决此问题。更准确地说,我们提出了一种潜在的结构支持向量机公式,其中潜在变量对人类注释中的任何缺失信息进行建模。我们特别感兴趣的是三种类型的注释:(i)使用通用前景或背景类别进行分割的图像; (ii)具有为对象指定的边框的图像; (iii)标记为表明存在某类的图像。通过使用大型的公共可用数据集,我们证明了我们的方法能够显着提高基于区域的模型的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号