Image-text bidirectional learning network based cross-modal retrieval

Li Zhuoyi; Lu Huibin; Fu HaoGu Guanghua

首页> 外文期刊>Neurocomputing >Image-text bidirectional learning network based cross-modal retrieval

【24h】

Image-text bidirectional learning network based cross-modal retrieval

机译：Image-text bidirectional learning network based cross-modal retrieval

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The problem of cross-modal retrieval has attracted significant attention in the cross-media retrieval community. One key challenge of cross-modal retrieval is to eliminate the heterogeneous gap between different patterns. The existing numerous cross-modal retrieval approaches tend to jointly construct a common subspace, while these methods fail to consider mutual influence between modalities sufficiently during the whole training process. In this paper, we propose a novel image-text Bidirectional Learning Network (BLN) based cross-modal retrieval method. The method constructs a common representation space and directly measures the similarity of heterogeneous data. More specifically, a multi-layer supervision network is proposed to learn the cross-modal relevance of the generated representations. Moreover, a bidirectional crisscross loss function is proposed to preserve the modal invariance with the bidirectional learning strategy in the common representation space. The loss functions of discriminant consistency and the bidirectional crisscross loss are integrated into an objective function which aims to minimize the intra-class distance and maximize the inter-class distance. Comprehensive experimental results on four widely-used databases show that the proposed method is effective and superior to the existing cross-modal retrieval methods. (c) 2022 Elsevier B.V. All rights reserved.

著录项

来源
《Neurocomputing》 |2022年第28期|148-159|共12页
作者
Li Zhuoyi; Lu Huibin; Fu HaoGu Guanghua;
展开▼
作者单位

Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao, Peoples R China|Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao, Hebei, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Cross-modal retrieval; bidirectional learning network; common representation space; discriminant consistency loss; bidirectional crisscross loss;

Image-text bidirectional learning network based cross-modal retrieval

摘要

著录项

相关主题

期刊订阅