【24h】

Deep Interactive Object Selection

机译:深度互动对象选择

获取原文

摘要

Interactive object selection is a very important research problem and has many applications. Previous algorithms require substantial user interactions to estimate the foreground and background distributions. In this paper, we present a novel deep-learning-based algorithm which has much better understanding of objectness and can reduce user interactions to just a few clicks. Our algorithm transforms user-provided positive and negative clicks into two Euclidean distance maps which are then concatenated with the RGB channels of images to compose (image, user interactions) pairs. We generate many of such pairs by combining several random sampling strategies to model users' click patterns and use them to finetune deep Fully Convolutional Networks (FCNs). Finally the output probability maps of our FCN-8s model is integrated with graph cut optimization to refine the boundary segments. Our model is trained on the PASCAL segmentation dataset and evaluated on other datasets with different object classes. Experimental results on both seen and unseen objects demonstrate that our algorithm has a good generalization ability and is superior to all existing interactive object selection approaches.
机译:交互式对象选择是一个非常重要的研究问题,具有许多应用。先前的算法需要大量的用户交互来估计前景和背景分布。在本文中,我们提出了一种新颖的基于深度学习的算法,该算法可以更好地了解对象,并且可以将用户交互减少到仅需单击几下即可。我们的算法将用户提供的正点击和负点击转换为两个欧几里得距离图,然后将其与图像的RGB通道连接起来以构成(图像,用户交互)对。我们通过组合几种随机采样策略来建模用户的点击模式,并使用它们来微调深层的全卷积网络(FCN),从而生成许多这样的对。最后,将我们的FCN-8s模型的输出概率图与图割优化集成在一起,以细化边界段。我们的模型是在PASCAL细分数据集上训练的,并在具有不同对象类别的其他数据集上进行了评估。对可见和不可见对象的实验结果表明,我们的算法具有良好的泛化能力,优于所有现有的交互式对象选择方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号