首页> 外文会议>Chinese conference on pattern recognition and computer vision >Modality Consistent Generative Adversarial Network for Cross-Modal Retrieval
【24h】

Modality Consistent Generative Adversarial Network for Cross-Modal Retrieval

机译:跨模态检索的模态一致生成对抗网络

获取原文

摘要

Cross-modal retrieval, which aims to perform the retrieval task across different modalities of data, is a hot topic. Since different modalities of data have inconsistent distributions, how to reduce the gap of different modalities is the core of cross-modal retrieval issue. Recently, Generative Adversarial Networks has been used in cross-modal retrieval due to its strong ability to model data distribution. We propose a novel approach named Modality Consistent Generative Adversarial Network for cross-modal retrieval (MCGAN). The network integrates a generator to generate synthetic image features from text features, a discriminator to classify the modality of features, and followed by a modality consistent embedding network that projects the generated image features and real image features into a common space for learning the discriminative representations. Experiments on two datasets prove the performance of MCGAN on cross-modal retrieval, compared with state-of-the-art related works.
机译:跨模式检索旨在跨数据的不同模式执行检索任务,这是一个热门话题。由于不同形式的数据具有不一致的分布,如何减少不同形式的差异是交叉模式检索问题的核心。近年来,由于其强大的数据分布建模能力,Generative Adversarial Networks已被用于交叉模式检索。我们提出了一种新颖的方法,用于跨模式检索(MCGAN)的模式一致性生成对抗网络。该网络集成了一个生成器,用于从文本特征生成合成图像特征;一个鉴别器,用于对特征模态进行分类;随后是一个模态一致性嵌入网络,该网络将生成的图像特征和真实图像特征投影到一个公共空间中,以学习判别式表示形式。 。与最新的相关工作相比,在两个数据集上的实验证明了MCGAN在交叉模式检索上的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号