Referring Expression Comprehension via Co-attention and Visual Context

机译：通过共同注意和视觉上下文引用表达理解

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a research hotspot of multimodal media analysis, referring expression comprehension locates the referred object region in an image by mapping a natural language. Though the localizing accuracy of similar objects is often distorted by the presence or absence of supporting objects in the referring expression, we propose a referring expression comprehension method via co-attention and visual context. For lacking supporting objects in referring expression, we propose co-attention to enhance the attention on attributes for the subject module. For existing supporting objects, we introduce visual context to explore the latent link between the candidate object and its supporters. Experiments on three datasets RefCOCO, RefCOCO+, and RefCOCOg, show that our approach outperforms published approaches by a considerable margin.

机译：作为多模式媒体分析的研究热点，参照表达理解通过映射自然语言来定位图像中的参照对象区域。尽管相似对象的定位精度通常会因引用表达中是否存在支持对象而失真，但我们还是通过共同注意和视觉上下文提出了一种表达表达理解方法。由于在引用表达时缺乏支持对象，我们建议共同注意以增强对主题模块属性的关注。对于现有的支持对象，我们引入视觉上下文来探索候选对象及其支持者之间的潜在链接。在三个数据集RefCOCO，RefCOCO +和RefCOCOg上进行的实验表明，我们的方法在很大程度上优于已发布的方法。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|119-130|共12页
会议地点
作者
Youming Gao; Yi Ji; Ting Xu; Yunlong Xu; Chunping Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Neural network; Co-attention; Visual context; Referring expression comprehension;

机译：神经网络;共同注意;视觉环境;引用表达理解;
入库时间 2022-08-26 13:53:48

相似文献

外文文献
中文文献
专利

1. Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions [J] . Niu Yulei, Zhang Hanwang, Lu Zhiwu, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第1期

机译：变分语境：利用用于接地的视觉和文本上下文表达式
2. The use of visual context during the production of referring expressions [J] . Fukumura K., van Gompel R.P.G., Pickering M.J. The quarterly journal of experimental psychology: QJEP . 2010,第9期

机译：在生成引用表达时使用视觉环境
3. The use of visual context during the production of referring expressions [J] . Kumiko Fukumura Roger P. G. van Gompel Martin J. Pickering The Quarterly Journal of Experimental Psychology . 2010,第9期

机译：在产生参照表达时使用视觉环境
4. Referring Expression Comprehension via Co-attention and Visual Context [C] . Youming Gao, Yi Ji, Ting Xu, International Conference on Artificial Neural Networks . 2019

机译：通过共同关注和视觉上下文引用表达式理解
5. Referring Expression Comprehension for CLEVR-Ref+ Dataset [D] . Rathor, Kuldeep Singh. 2020

机译：引用CLEVR-REF + DataSet的表达式理解
6. How Visual Word Decoding and Context-Driven Auditory Semantic Integration Contribute to Reading Comprehension: A Test of Additive vs. Multiplicative Models [O] . Yu Li, Hongbing Xing, Linjun Zhang, 2021

机译：视觉词解码和上下文驱动的听觉语义集成有助于阅读理解：添加剂与乘法模型的测试
7. The use of visual context during the production of referring expressions [O] . Fukumura Kumiko, van-Gompel Roger P. G., Pickering Martin J. 2010

机译：在产生参照表达时使用视觉环境

Referring Expression Comprehension via Co-attention and Visual Context

摘要

著录项

相似文献

相关主题

期刊订阅