Category Alignment Adversarial Learning for Cross-Modal Retrieval

Shiyuan He; Weiyang Wang; Zheng WangXing XuYang YangXiaoming WangHeng Tao Shen

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Category Alignment Adversarial Learning for Cross-Modal Retrieval

【24h】

Category Alignment Adversarial Learning for Cross-Modal Retrieval

机译：Category Alignment Adversarial Learning for Cross-Modal Retrieval

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Cross-modal retrieval aims to retrieve one semantically similar media from multiple media types based on queries entered by another type of media. An intuitive idea is to map different media data into a common space and then directly measure content similarity between different types of data. In this paper, we present a novel method, called Category Alignment Adversarial Learning (CAAL) for cross-modal retrieval. It aims to find a common representation space supervised by category information, in which the samples from different modalities can be compared directly. Specifically, CAAL first employs two parallel encoders to generate common representations for image and text features respectively. Furthermore, we employ two parallel GANs with category information to generate fake image and text features which next will be utilized with already generated embedding to reconstruct the common representation. At last, two joint discriminators are utilized to reduce the gap between the mapping of the first stage and the embedding of the second stage. Comprehensive experimental results on four widely-used benchmark datasets demonstrate the superior performance of our proposed method compared with the state-of-the-art approaches.

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2023年第5期|4527-4538|共12页
作者
Shiyuan He; Weiyang Wang; Zheng WangXing XuYang YangXiaoming WangHeng Tao Shen;
展开▼
作者单位

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China;

School of Management and Economics, University of Electronic Science and Technology of China, Chengdu, China;

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China|Peng Cheng Laboratory, Shenzhen, China;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类计算技术、计算机技术;
关键词
Semantics; Correlation; Media; Adversarial machine learning; Pairwise error probability; Hidden Markov models; Feature extraction;

Category Alignment Adversarial Learning for Cross-Modal Retrieval

摘要

著录项

相关主题

期刊订阅