首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
【24h】

Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

机译:联合引用表达理解和分段的多任务协作网络

获取原文
获取外文期刊封面目录资料

摘要

Referring expression comprehension (REC) and segmentation (RES) are two highly-related tasks, which both aim at identifying the referent according to a natural language expression. In this paper, we propose a novel Multi-task Collaborative Network (MCN) to achieve a joint learning of REC and RES for the first time. In MCN, RES can help REC to achieve better language-vision alignment, while REC can help RES to better locate the referent. In addition, we address a key challenge in this multi-task setup, i.e., the prediction conflict, with two innovative designs namely, Consistency Energy Maximization (CEM) and Adaptive Soft Non-Located Suppression (ASNLS). Specifically, CEM enables REC and RES to focus on similar visual regions by maximizing the consistency energy between two tasks. ASNLS supresses the response of unrelated regions in RES based on the prediction of REC. To validate our model, we conduct extensive experiments on three benchmark datasets of REC and RES, i.e., RefCOCO, RefCOCO+ and RefCOCOg. The experimental results report the significant performance gains of MCN over all existing methods, i.e., up to +7.13% for REC and +11.50% for RES over SOTA, which well confirm the validity of our model for joint REC and RES learning.
机译:引用表达理解(REC)和分段(RES)是两个高度相关的任务,它们都旨在根据自然语言表达来识别引用对象。在本文中,我们提出了一种新颖的多任务协作网络(MCN),以便首次实现REC和RES的联合学习。在MCN中,RES可以帮助REC实现更好的语言视觉对齐,而REC可以帮助RES更好地定位参考对象。另外,我们通过两种创新设计,即一致性能量最大化(CEM)和自适应软非定位抑制(ASNLS),解决了这种多任务设置中的关键挑战,即预测冲突。具体而言,CEM通过最大化两个任务之间的一致性能量,使REC和RES专注于相似的视觉区域。基于REC的预测,ASNLS抑制了RES中无关区域的响应。为了验证我们的模型,我们在REC和RES的三个基准数据集(即RefCOCO,RefCOCO +和RefCOCOg)上进行了广泛的实验。实验结果表明,与所有现有方法相比,MCN的性能都有显着提高,即REC的SON最高+ 7.13%,RES的SON最高+ 11.50%,这很好地证实了我们的REC和RES联合学习模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号