首页> 外文会议>IEEE International Conference on Image Processing >Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal Retrieval
【24h】

Inter-Modality Fusion Based Attention for Zero-Shot Cross-Modal Retrieval

机译:基于模态融合的零射跨模态检索注意力

获取原文

摘要

Zero-shot cross-modal retrieval (ZS-CMR) performs the task of cross-modal retrieval where the classes of test categories have a different scope than the training categories. It borrows the intuition from zero-shot learning which targets to transfer the knowledge inferred during the training phase for seen classes to the testing phase for unseen classes. It mimics the real-world scenario where new object categories are continuously populating the multi-media data corpus. Unlike existing ZS-CMR approaches which use generative adversarial networks (GANs) to generate more data, we propose Inter-Modality Fusion based Attention (IMFA) and a framework ZS_INN_FUSE (Zero-Shot cross-modal retrieval using INNer product with image-text FUSEd). It exploits the rich semantics of textual data as guidance to infer additional knowledge during the training phase. This is achieved by generating attention weights through the fusion of image and text modalities to focus on the important regions in an image. We carefully create a zero-shot split based on the large-scale MS-COCO and Flickr30k datasets to perform experiments. The results show that our method achieves improvement over the ZS-CMR baseline and self-attention mechanism, demonstrating the effectiveness of inter-modality fusion in a zero-shot scenario.
机译:零拍摄的跨模型检索(ZS-CMR)执行跨模板检索的任务,其中测试类别的类别与培训类别不同。它借用零射击学习的直觉,该直觉将在训练阶段在训练阶段转移到看不见的阶段的课程中的知识。它模仿现实世界的场景,其中新的对象类别持续填充多媒体数据语料库。与使用生成对冲网络(GANS)生成更多数据的现有ZS-CMR方法不同,我们提出基于模态融合的注意力(IMFA)和框架ZS_INN_FUSE(使用内部产品融合的内部产品零拍摄的跨模型检索)。它利用文本数据的丰富语义作为在培训阶段推断出额外知识的指导。这是通过通过融合图像和文本方式产生注意重量来实现的,以专注于图像中的重要区域。我们仔细创建了基于大型MS-Coco和Flickr30k数据集的零拍分割,以执行实验。结果表明,我们的方法达到了ZS-CMR基线和自我关注机制的改进,展示了零射景中模态融合的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号