Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

Di Wang; Xinbo Gao; Xiumei Wang; Lihuo He; Bo Yuan

首页> 外文期刊>IEEE Transactions on Image Processing >Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

【24h】

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

机译：大规模跨模态检索的多模态判别二进制嵌入

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Multimodal hashing, which conducts effective and efficient nearest neighbor search across heterogeneous data on large-scale multimedia databases, has been attracting increasing interest, given the explosive growth of multimedia content on the Internet. Recent multimodal hashing research mainly aims at learning the compact binary codes to preserve semantic information given by labels. The overwhelming majority of these methods are similarity preserving approaches which approximate pairwise similarity matrix with Hamming distances between the to-be-learnt binary hash codes. However, these methods ignore the discriminative property in hash learning process, which results in hash codes from different classes undistinguished, and therefore reduces the accuracy and robustness for the nearest neighbor search. To this end, we present a novel multimodal hashing method, named multimodal discriminative binary embedding (MDBE), which focuses on learning discriminative hash codes. First, the proposed method formulates the hash function learning in terms of classification, where the binary codes generated by the learned hash functions are expected to be discriminative. And then, it exploits the label information to discover the shared structures inside heterogeneous data. Finally, the learned structures are preserved for hash codes to produce similar binary codes in the same class. Hence, the proposed MDBE can preserve both discriminability and similarity for hash codes, and will enhance retrieval accuracy. Thorough experiments on benchmark data sets demonstrate that the proposed method achieves excellent accuracy and competitive computational efficiency compared with the state-of-the-art methods for large-scale cross-modal retrieval task.

机译：考虑到Internet上多媒体内容的爆炸性增长，多模式哈希在大型多媒体数据库中的异构数据之间进行有效，高效的最近邻搜索，引起了越来越多的兴趣。最近的多峰哈希研究主要旨在学习紧凑的二进制代码以保留标签给出的语义信息。这些方法中的绝大多数是相似性保留方法，其利用要学习的二进制哈希码之间的汉明距离来近似成对相似性矩阵。但是，这些方法忽略了哈希学习过程中的判别属性，这导致无法区分来自不同类别的哈希码，因此降低了最近邻居搜索的准确性和鲁棒性。为此，我们提出了一种新颖的多模式散列方法，称为多模式判别二进制嵌入（MDBE），该方法着重于学习判别性散列码。首先，所提出的方法根据分类表述了哈希函数学习，其中由学习的哈希函数生成的二进制代码预期是有区别的。然后，它利用标签信息发现异构数据内部的共享结构。最后，保留学习的结构用于哈希码，以在同一类中生成相似的二进制代码。因此，提出的MDBE可以保留哈希码的可区分性和相似性，并提高检索的准确性。在基准数据集上进行的充分实验表明，与用于大规模跨模态检索任务的最新方法相比，该方法具有出色的准确性和具有竞争力的计算效率。

著录项

来源
《IEEE Transactions on Image Processing》 |2016年第10期|4540-4554|共15页
作者
Di Wang; Xinbo Gao; Xiumei Wang; Lihuo He; Bo Yuan;
展开▼
作者单位

Video and Image Processing System Laboratory, School of Electronic Engineering, Xidian University, Xi’an, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Multimodal; binary code; cross-modal retrieval; discriminant analysis; hashing;

机译：多模式二进制代码跨模式检索判别分析散列;

相似文献

外文文献
中文文献
专利

1. Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval [J] . Xing Xu, Fumin Shen, Yang Yang, Image Processing, IEEE Transactions on . 2017,第5期

机译：学习判别式二进制代码以进行大规模的跨模态检索
2. Binary Set Embedding for Cross-Modal Retrieval [J] . Mengyang Yu, Li Liu, Ling Shao Neural Networks and Learning Systems, IEEE Transactions on . 2017,第12期

机译：跨模态检索的二进制集嵌入
3. Large-scale document image retrieval and classification with runlength histograms and binary embeddings [J] . Gordo A., Perronnin F., Valveny E. Pattern Recognition: The Journal of the Pattern Recognition Society . 2013,第7期

机译：具有游程直方图和二进制嵌入的大规模文档图像检索和分类
4. Scalable Discriminative Discrete Hashing For Large-Scale Cross-Modal Retrieval [C] . Jianyang Qin, Lunke Fei, Jian Zhu, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：用于大规模交叉模态检索的可扩展鉴别离散散列
5. Large-scale image retrieval using similarity preserving binary codes [D] . Gong, Yunchao 2014

机译：使用保留相似性的二进制代码进行大规模图像检索
6. Deep Unsupervised Hashing for Large-Scale Cross-Modal Retrieval Using Knowledge Distillation Model [O] . Mingyong Li, Qiqi Li, Lirong Tang, 2021

机译：使用知识蒸馏模型进行大规模交叉模态检索的深度无监督散列
7. FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval [O] . Xin Liu, Xingzhi Wang, Yiu-ming Cheung 2021

机译：FDDH：用于大规模交叉模态检索的快速辨别离散散列

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅