首页> 外文会议>International conference on computer analysis of images and patterns >Learning a Limited Text Space for Cross-Media Retrieval
【24h】

Learning a Limited Text Space for Cross-Media Retrieval

机译:学习用于跨媒体检索的有限文本空间

获取原文

摘要

In this paper, we propose a novel model for cross-media retrieval which relies on a limited text space rather than a common space or an image space. More specifically, the model consists of three parts: A visual part that consists of a convolutional neural network and an image understanding network; A language model part that achieves sentence understanding by recurrent neural network; An embedding part that contains a fusion layer to capture both visual label information and semantic correlations between images and sentences, as well as learn the final limited text space by optimizing pairwise ranking loss. Experimental results on three benchmark datasets show that our proposed model gains promising improvement in accuracy for cross-media retrieval especially on sentence retrieval compared with the related state-of-the-art methods.
机译:在本文中,我们提出了一种新颖的跨媒体检索模型,该模型依赖于有限的文本空间而不是公共空间或图像空间。更具体地说,该模型包括三个部分:视觉部分,包括卷积神经网络和图像理解网络;通过递归神经网络实现句子理解的语言模型部分;包含融合层的嵌入部分,可捕获视觉标签信息以及图像和句子之间的语义相关性,以及通过优化成对排名损失来学习最终的受限文本空间。在三个基准数据集上的实验结果表明,与相关的最新方法相比,我们提出的模型在跨媒体检索(尤其是句子检索)的准确性方面有望获得改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号