首页> 外文会议>International Conference on Multimedia Modeling >A New Local Transformation Module for Few-Shot Segmentation
【24h】

A New Local Transformation Module for Few-Shot Segmentation

机译:用于少量分割的新局部转换模块

获取原文

摘要

Few-shot segmentation segments object regions of new classes with a few of manual annotations. Its key step is to establish the transformation module between support images (annotated images) and query images (unlabeled images), so that the segmentation cues of support images can guide the segmentation of query images. The existing methods form transformation model based on global cues, which however ignores the local cues that are verified in this paper to be very important for the transformation. This paper proposes a new transformation module based on local cues, where the relationship of the local features is used for transformation. To enhance the generalization performance of the network, the relationship matrix is calculated in a high-dimensional metric embedding space based on cosine distance. In addition, to handle the challenging mapping problem from the low-level local relationships to high-level semantic cues, we propose to apply generalized inverse matrix of the annotation matrix of support images to transform the relationship matrix linearly, which is non-parametric and class-agnostic. The result by the matrix transformation can be regarded as an attention map with high-level semantic cues, based on which a transformation module can be built simply. The proposed transformation module is a general module that can be used to replace the transformation module in the existing few-shot segmentation frameworks. We verify the effectiveness of the proposed method on Pascal VOC 2012 dataset. The value of mIoU achieves at 57.0% in 1-shot and 60.6% in 5-shot, which outperforms the state-of-the-art method by 1.6% and 3.5%, respectively.
机译:很少有的分割方法会通过一些手动注释来分割新类的对象区域。它的关键步骤是建立支持图像(带注释的图像)和查询图像(未标记的图像)之间的转换模块,以便支持图像的分割线索可以指导查询图像的分割。现有方法形成了基于全局提示的转换模型,但是忽略了本文证明的对于转换非常重要的局部提示。本文提出了一种基于局部线索的新的变换模块,其中利用局部特征的关系进行变换。为了提高网络的泛化性能,基于余弦距离在高维度量嵌入空间中计算关系矩阵。此外,为了处理从低级本地关系到高级语义线索的挑战性映射问题,我们建议应用支持图像注释矩阵的广义逆矩阵对关系矩阵进行线性变换,该关系矩阵是非参数的,并且类不可知的。矩阵转换的结果可以看作是具有高级语义提示的注意图,基于此可以简单地构建转换模块。提议的转换模块是一个通用模块,可用于替换现有的少量快照分割框架中的转换模块。我们在Pascal VOC 2012数据集上验证了该方法的有效性。 1次击打时的mIoU值达到57.0%,5次击打时达到60.6%,分别比最新方法高1.6%和3.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号