Learning a Limited Text Space for Cross-Media Retrieval

机译：学习用于跨媒体检索的有限文本空间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel model for cross-media retrieval which relies on a limited text space rather than a common space or an image space. More specifically, the model consists of three parts: A visual part that consists of a convolutional neural network and an image understanding network; A language model part that achieves sentence understanding by recurrent neural network; An embedding part that contains a fusion layer to capture both visual label information and semantic correlations between images and sentences, as well as learn the final limited text space by optimizing pairwise ranking loss. Experimental results on three benchmark datasets show that our proposed model gains promising improvement in accuracy for cross-media retrieval especially on sentence retrieval compared with the related state-of-the-art methods.

机译：在本文中，我们提出了一种新颖的跨媒体检索模型，该模型依赖于有限的文本空间而不是公共空间或图像空间。更具体地说，该模型包括三个部分：视觉部分，包括卷积神经网络和图像理解网络;通过递归神经网络实现句子理解的语言模型部分;包含融合层的嵌入部分，可捕获视觉标签信息以及图像和句子之间的语义相关性，以及通过优化成对排名损失来学习最终的受限文本空间。在三个基准数据集上的实验结果表明，与相关的最新方法相比，我们提出的模型在跨媒体检索（尤其是句子检索）的准确性方面有望获得改善。

著录项

来源
《International conference on computer analysis of images and patterns》|2017年|292-303|共12页
会议地点
作者
Zheng Yu; Wenmin Wang; Mengdi Fan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cross-media retrieval; Limited text space; Fusion layer; Image understanding network; Recurrent neural network;

机译：跨媒体检索;有限的文字空间;融合层;图像理解网络;递归神经网络;

相似文献

外文文献
中文文献
专利

1. Joint Image-Text Hashing for Fast Large-Scale Cross-Media Retrieval Using Self-Supervised Deep Learning [J] . Gengshen Wu, Jungong Han, Zijia Lin, IEEE Transactions on Industrial Electronics . 2019,第12期

机译：使用自监督式深度学习的联合图像-文本哈希用于快速大规模跨媒体检索
2. Cross-media retrieval with collective deep semantic learning [J] . Bin Zhang, Lei Zhu, Jiande Sun, Multimedia Tools and Applications . 2018,第17期

机译：跨媒体检索与集体深度语义学习
3. Cross-media retrieval based on semi-supervised regularization and correlation learning [J] . Hong Zhang, Gang Dai, Du Tang, Multimedia Tools and Applications . 2018,第17期

机译：基于半监督正则化和相关学习的跨媒体检索
4. Learning a Limited Text Space for Cross-Media Retrieval [C] . Zheng Yu, Wenmin Wang, Mengdi Fan International Conference on Computer Analysis of Images and Patterns . 2017

机译：学习有限的跨媒检索文本空间
5. Learning Robust Visual-Semantic Retrieval Models with Limited Supervision [D] . Mithun, Niluthpol Chowdhury. 2019

机译：学习强大的视觉语义检索模型，监督有限
6. Can Music Foster Learning – Effects of Different Text Modalities on Learning and Information Retrieval [O] . Janina A. M. Lehmann, Tina Seufert -1

机译：音乐能促进学习吗–不同文本模式对学习和信息检索的影响
7. Joint Image-Text Hashing for Fast Large-Scale Cross-Media Retrieval Using Self-Supervised Deep Learning [O] . Gengshen Wu, Jungong Han, Zijia Lin, 2019

机译：用于快速大规模交叉媒体检索的联合图像文本散列使用自我监督深度学习

Learning a Limited Text Space for Cross-Media Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅