首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Tell Me What You See and I Will Show You Where It Is
【24h】

Tell Me What You See and I Will Show You Where It Is

机译:告诉我你所看到的,我会告诉你它在哪里

获取原文

摘要

We tackle the problem of weakly labeled semantic segmentation, where the only source of annotation are image tags encoding which classes are present in the scene. This is an extremely difficult problem as no pixel-wise labelings are available, not even at training time. In this paper, we show that this problem can be formalized as an instance of learning in a latent structured prediction framework, where the graphical model encodes the presence and absence of a class as well as the assignments of semantic labels to superpixels. As a consequence, we are able to leverage standard algorithms with good theoretical properties. We demonstrate the effectiveness of our approach using the challenging SIFT-flow dataset and show average per-class accuracy improvements of 7% over the state-of-the-art.
机译:我们解决了弱标记语义分割的问题,其中唯一的注释来源是图像标签,用于编码场景中存在的类。这是一个非常困难的问题,因为甚至在训练时也没有可用的按像素标记。在本文中,我们证明了该问题可以形式化为潜在的结构化预测框架中的学习实例,其中的图形模型编码一类的存在与否以及语义标签对超像素的分配。因此,我们能够利用具有良好理论特性的标准算法。我们使用具有挑战性的SIFT流数据集证明了我们的方法的有效性,并且显示出与最新技术相比,每类平均准确性提高了7%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号