首页> 外文期刊>IEEE Transactions on Image Processing >Bi-Directional Spatial-Semantic Attention Networks for Image-Text Matching
【24h】

Bi-Directional Spatial-Semantic Attention Networks for Image-Text Matching

机译:图像文本匹配的双向空间语义注意网络

获取原文
获取原文并翻译 | 示例
       

摘要

Image-text matching by deep models has recently made remarkable achievements in many tasks, such as image caption and image search. A major challenge of matching the image and text lies in that they usually have complicated underlying relations between them and simply modeling the relations may lead to suboptimal performance. In this paper, we develop a novel approach bi-directional spatial-semantic attention network, which leverages both the word to regions (W2R) relation and visual object to words (O2W) relation in a holistic deep framework for more effectively matching. Specifically, to effectively encode the W2R relation, we adopt LSTM with bilinear attention function to infer the image regions which are more related to the particular words, which is referred as the W2R attention networks. On the other side, the O2W attention networks are proposed to discover the semantically close words for each visual object in the image, i.e., the visual O2W relation. Then, a deep model unifying both of the two directional attention networks into a holistic learning framework is proposed to learn the matching scores of image and text pairs. Compared to the existing image-text matching methods, our approach achieves state-of-the-art performance on the datasets of Flickr30K and MSCOCO.
机译:深度模型的图像文本匹配最近在许多任务中取得了显著成就,例如图像标题和图像搜索。匹配图像和文本的主要挑战在于它们之间通常具有复杂的潜在关系,并且简单地对关系进行建模可能会导致性能欠佳。在本文中,我们开发了一种新颖的双向空间语义注意网络,该网络利用整体深度框架中的词与区域(W2R)关系和视觉对象与词(O2W)关系,以更有效地进行匹配。具体来说,为了有效地编码W2R关系,我们采用具有双线性注意功能的LSTM来推断与特定单词更相关的图像区域,这被称为W2R注意网络。另一方面,提出了O2W注意网络来发现图像中每个视觉对象的语义上接近的词,即视觉O2W关系。然后,提出了将两个方向注意力网络都整合为整体学习框架的深度模型,以学习图像和文本对的匹配分数。与现有的图像文本匹配方法相比,我们的方法在Flickr30K和MSCOCO的数据集上实现了最先进的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号