首页> 外文会议>International conference on multimedia modeling >Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval
【24h】

Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval

机译:联合注意跨媒体检索的递归金字塔网络

获取原文

摘要

Cross-media retrieval has raised wide attention in recent years, for its flexibility in retrieving results across different media types by a query of any media type. Besides studying on the global information of the samples, some recent works focus on the regions of the samples to mine local information for better correlation learning of different media types. However, these works focus on the correlations of regions and sample, while ignoring the correlations between regions, including the significance of each region among all of them, and the supplementary information between the region and its sub-regions, similar to the sample and its regions. For addressing this problem, this paper proposes a new recursive pyramid network with joint attention (RPJA) for cross-media retrieval, which has two main contributions: (1) We repeatedly partition the sample into increasingly fine regions in a pyramid structure, and the representation of sample is generated by modeling the supplementary information, which is provided by the regions and their sub-regions recursively from the bottom to top of pyramid. (2) We propose a joint attention model connecting different media types in each pyramid level, which mines the intra-media information and inter-media correlations to guide the learning of significance of each region, further improving the performance of correlation learning. Experiments on two widely-used datasets compared with state-of-the-art methods verify the effectiveness of our proposed approach.
机译:跨媒体检索近年来获得了广泛的关注,因为它具有通过查询任何媒体类型来检索不同媒体类型的结果的灵活性。除了研究样本的全局信息外,最近的一些工作还集中在样本的区域以挖掘局部信息,以便更好地学习不同媒体类型的相关性。但是,这些工作着重于区域和样本的相关性,而忽略了区域之间的相关性,包括所有区域之间每个区域的重要性以及与样本及其区域相似的区域及其子区域之间的补充信息。地区。为了解决这个问题,本文提出了一种新的具有联合注意力的递归金字塔网络(RPJA),用于跨媒体检索,它有两个主要贡献:(1)我们将样本重复地划分为金字塔结构中越来越细的区域,并且样本表示是通过对补充信息进行建模而生成的,补充信息由区域及其子区域从金字塔的底部到顶部递归提供。 (2)我们提出了一个联合注意模型,该模型将每个金字塔级别的不同媒体类型连接起来,该模型挖掘媒体内信息和媒体间相关性,以指导每个区域重要性的学习,从而进一步提高相关性学习的性能。与最先进的方法相比,在两个广泛使用的数据集上进行的实验证明了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号