...
首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention
【24h】

Robust Deep Co-Saliency Detection With Group Semantic and Pyramid Attention

机译:群体语义和金字塔关注的强大的深度合作检测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

High-level semantic knowledge in addition to low-level visual cues is essentially crucial for co-saliency detection. This article proposes a novel end-to-end deep learning approach for robust co-saliency detection by simultaneously learning high-level groupwise semantic representation as well as deep visual features of a given image group. The interimage interaction at the semantic level and the complementarity between the group semantics and visual features are exploited to boost the inferring capability of co-salient regions. Specifically, the proposed approach consists of a co-category learning branch and a co-saliency detection branch. While the former is proposed to learn a groupwise semantic vector using co-category association of an image group as supervision, the latter is to infer precise co-salient maps based on the ensemble of group-semantic knowledge and deep visual cues. The group-semantic vector is used to augment visual features at multiple scales and acts as a top-down semantic guidance for boosting the bottom-up inference of co-saliency. Moreover, we develop a pyramidal attention (PA) module that endows the network with the capability of concentrating on important image patches and suppressing distractions. The co-category learning and co-saliency detection branches are jointly optimized in a multitask learning manner, further improving the robustness of the approach. We construct a new large-scale co-saliency data set COCO-SEG to facilitate research of the co-saliency detection. Extensive experimental results on COCO-SEG and a widely used benchmark Cosal2015 have demonstrated the superiority of the proposed approach compared with state-of-the-art methods.
机译:高级语义知识除了低级别的视觉提示外,对于合理检测至关重要。本文通过同时学习高级集团语义表示以及给定图像组的深度可视特征,提出了一种新的端到端深度学习方法,可通过同时学习高级集团语义表示以及给定图像组的深度视觉特征。利用语义级别的相互区交互和组语义和视觉特征之间的互补性以提高共缘区域的推断能力。具体地,所提出的方法包括共同类别学习分支和共同显着性检测分支。虽然前者被提出使用图像组的共同类别协会作为监督的共同类别协会学习集团语义矢量,但后者是基于组语义知识和深视觉线索的集合推断精确的共同映射。组语义向量用于在多个尺度上增加视觉特征,并充当自上而下的语义指导,用于提高共同显着性的自下而上推断。此外,我们开发了一种金字塔关注(PA)模块,其赋予网络具有集中在重要的图像斑块和抑制分心的能力。共同类别的学习和共同显着性检测分支以多任务学习方式共同优化,进一步提高了方法的稳健性。我们构建了一个新的大型合理化数据集Coco-SEG,以促进对共同显着性检测的研究。 COCO-SEG和广泛使用的基准Cosal2015的广泛实验结果表明,与最先进的方法相比,所提出的方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号