Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval

首页> 外文期刊>Quantum electronics >Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval

【24h】

Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval

机译：具有自我监督的三元对抗网络，用于零射频跨模型检索

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given a query instance from one modality (e.g., image), cross-modal retrieval aims to find semantically similar instances from another modality (e.g., text). To perform cross-modal retrieval, existing approaches typically learn a common semantic space from a labeled source set and directly produce common representations in the learned space for the instances in a target set. These methods commonly require that the instances of both two sets share the same classes. Intuitively, they may not generalize well on a more practical scenario of zero-shot cross-modal retrieval, that is, the instances of the target set contain unseen classes that have inconsistent semantics with the seen classes in the source set. Inspired by zero-shot learning, we propose a novel model called ternary adversarial networks with self-supervision (TANSS) in this paper, to overcome the limitation of the existing methods on this challenging task. Our TANSS approach consists of three paralleled subnetworks: 1) two semantic feature learning subnetworks that capture the intrinsic data structures of different modalities and preserve the modality relationships via semantic features in the common semantic space; 2) a self-supervised semantic subnetwork that leverages the word vectors of both seen and unseen labels as guidance to supervise the semantic feature learning and enhances the knowledge transfer to unseen labels; and 3) we also utilize the adversarial learning scheme in our TANSS to maximize the consistency and correlation of the semantic features between different modalities. The three subnetworks are integrated in our TANSS to formulate an end-to-end network architecture which enables efficient iterative parameter optimization. Comprehensive experiments on three cross-modal datasets show the effectiveness of our TANSS approach compared with the state-of-the-art methods for zero-shot cross-modal retrieval.

机译：给定来自一个模态（例如，图像）的查询实例，跨模板检索旨在从另一个模态找到语义类似的实例（例如，文本）。为了执行跨模态检索，现有方法通常从标记的源集中学习公共语义空间，并直接在目标集中的实例中直接生成学习空间中的公共表示。这些方法通常要求两组的实例共享相同的类。直观地，它们可能不会概括零拍摄跨模型检索的更实际场景，即目标集的实例包含具有源集中所看到的类的语义具有不一致的语义的看不见的类。灵感来自零射击学习，我们提出了一款名为Ternary Profersarial Networks的新型型号，本文中具有自我监督（陈列），克服了对这项挑战性任务的现有方法的限制。我们的味道方法由三个并联子网组成：1）两个语义特征学习子网，用于捕获不同模式的内在数据结构，并通过公共语义空间中的语义特征来保留模态关系; 2）一个自我监督的语义子网，利用看到和看不见标签的单词向量作为监督语义特征学习的指导，并增强知识转移到看不见的标签; 3）我们还利用了我们蛋白的对抗性学习计划来最大限度地提高不同模式之间的语义特征的一致性和相关性。三个子网集成在我们的营件中，以制定能够高效的迭代参数优化的端到端网络架构。与三个跨模型数据集的综合实验表明，与最先进的零射频检索的最新方法相比，我们的蛋白曲线方法的有效性。

著录项

来源
《Quantum electronics 》 |2020年第6期| 共14页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类光电子技术、激光技术 ;
关键词
Semantics; Correlation; Knowledge transfer; Standards; Task analysis; Training; Feature extraction; Adversarial learning; cross-modal retrieval; self-supervision; zero-shot learning (ZSL);

机译：语义;相关;知识转移;标准;任务分析;培训;特征提取;对抗 - 模态检索;自我监督;零射击学习（ZSL）;

相似文献

外文文献
中文文献
专利

1. Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval [J] . Quantum electronics . 2020 ,第6期

机译：具有自我监督的三元对抗网络，用于零射频跨模型检索
2. Deep attentional fine-grained similarity network with adversarial learning for cross-modal retrieval [J] . Qingrong Cheng, Xiaodong Gu Multimedia Tools and Applications . 2020 ,第41a42期

机译：深度预关注细粒度相似网络，对跨模型检索的对抗学习
3. Modality-specific and shared generative adversarial network for cross-modal retrieval [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020 ,第期

机译：用于跨模型检索的模态和共享生成的对抗性网络
4. Dual Adversarial Networks for Zero-shot Cross-media Retrieval [C] . Jingze Chi, Yuxin Peng International Joint Conference on Artificial Intelligence . 2018

机译：用于零射频跨媒检索的双对抗网络
5. Cross-Modal Data Retrieval and Generation Using Deep Neural Networks [D] . Udaiyar, Premkumar. 2020

机译：使用深神经网络的跨模型数据检索和生成
6. Cross-Modal Search for Social Networks via Adversarial Learning [O] . Nan Zhou, Junping Du, Zhe Xue, 2020

机译：通过对抗学习的跨模型搜索社交网络
7. Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval [O] . Chao Li, Cheng Deng, Ning Li, 2018

机译：用于跨模型检索的自我监督的对抗性散列网络
8. Self-Supervision in Multilayer Adaptive Networks [R] . Luttrell, S. P. 1991

机译：多层自适应网络中的自我监督

Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅