首页> 外文会议>International Conference on Artificial Neural Networks >Referring Expression Comprehension via Co-attention and Visual Context
【24h】

Referring Expression Comprehension via Co-attention and Visual Context

机译:通过共同注意和视觉上下文引用表达理解

获取原文

摘要

As a research hotspot of multimodal media analysis, referring expression comprehension locates the referred object region in an image by mapping a natural language. Though the localizing accuracy of similar objects is often distorted by the presence or absence of supporting objects in the referring expression, we propose a referring expression comprehension method via co-attention and visual context. For lacking supporting objects in referring expression, we propose co-attention to enhance the attention on attributes for the subject module. For existing supporting objects, we introduce visual context to explore the latent link between the candidate object and its supporters. Experiments on three datasets RefCOCO, RefCOCO+, and RefCOCOg, show that our approach outperforms published approaches by a considerable margin.
机译:作为多模式媒体分析的研究热点,参照表达理解通过映射自然语言来定位图像中的参照对象区域。尽管相似对象的定位精度通常会因引用表达中是否存在支持对象而失真,但我们还是通过共同注意和视觉上下文提出了一种表达表达理解方法。由于在引用表达时缺乏支持对象,我们建议共同注意以增强对主题模块属性的关注。对于现有的支持对象,我们引入视觉上下文来探索候选对象及其支持者之间的潜在链接。在三个数据集RefCOCO,RefCOCO +和RefCOCOg上进行的实验表明,我们的方法在很大程度上优于已发布的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号