Multi-hop Interactive Cross-Modal Retrieval

机译：多跳交互式跨模态检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional representation learning based cross-modal retrieval approaches always represent the sentence with a global embedding feature, which easily neglects the local correlations between objects in the image and phrases in the sentence. In this paper, we present a novel Multi-hop Interactive Cross-modal Retrieval Model (MICRM), which interactively exploits the local correlations between images and words. We design a multi-hop interactive module to infer the high-order relevance between the image and the sentence. Experimental results on two benchmark datasets, MS-COCO and Flickr30K, demonstrate that our multi-hop interactive model performs significantly better than several competitive cross-modal retrieval methods.

机译：基于常规表示学习的跨模态检索方法始终使用全局嵌入功能来表示句子，这很容易忽略了图像中的对象与句子中的短语之间的局部相关性。在本文中，我们提出了一种新颖的多跳交互式跨模态检索模型（MICRM），该模型以交互方式利用了图像和单词之间的局部相关性。我们设计了一个多跳互动模块来推断图像和句子之间的高阶相关性。在两个基准数据集MS-COCO和Flickr30K上的实验结果表明，我们的多跳交互式模型的性能明显优于几种竞争性的跨模式检索方法。

著录项

来源
《International Conference on Multimedia Modeling》|2020年|681-693|共13页
会议地点
作者
Xuecheng Ning; Xiaoshan Yang; Changsheng Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cross-modal retrieval; Deep learning; LSTMs;

机译：跨模式检索;深度学习; LSTM;
入库时间 2022-08-26 13:55:05

相似文献

外文文献
中文文献
专利

1. Deep Multiscale Fusion Hashing for Cross-Modal Retrieval [J] . Nie Xiushan, Wang Bowei, Li Jiajia, IEEE Transactions on Circuits and Systems for Video Technology . 2021,第1期

机译：跨模型检索的深层多尺度融合散列
2. Deep Cross-Modal Face Naming for People News Retrieval [J] . Tian Yong, Zhou Lian, Zhang Yuejie, IEEE Transactions on Knowledge and Data Engineering . 2021,第5期

机译：深度跨莫德脸命名为人们新闻检索
3. Comparative analysis on cross-modal information retrieval: A review [J] . Parminder Kaur, Husanbir Singh Pannu, Avleen Kaur Malhi Computer science review . 2021,第Feba期

机译：跨莫代尔信息检索的比较分析：综述
4. Multi-hop Interactive Cross-Modal Retrieval [C] . Xuecheng Ning, Xiaoshan Yang, Changsheng Xu International Conference on Multimedia Modeling . 2020

机译：多跳交互式交叉模态检索
5. Cross-Modal Data Retrieval and Generation Using Deep Neural Networks [D] . Udaiyar, Premkumar. 2020

机译：使用深神经网络的跨模型数据检索和生成
6. Deep Unsupervised Hashing for Large-Scale Cross-Modal Retrieval Using Knowledge Distillation Model [O] . Mingyong Li, Qiqi Li, Lirong Tang, 2021

机译：使用知识蒸馏模型进行大规模交叉模态检索的深度无监督散列
7. Peer Review #3 of "Improvement of deep cross-modal retrieval by generating real-valued representation (v0.2)" [O] . 2021

机译：同行评论第3名“通过生成真实值表示（v0.2）”的“改善深层跨模式检索”

Multi-hop Interactive Cross-Modal Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅