首页> 外文期刊>IEEE Transactions on Image Processing >Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval
【24h】

Multimodal Discriminative Binary Embedding for Large-Scale Cross-Modal Retrieval

机译:大规模跨模态检索的多模态判别二进制嵌入

获取原文
获取原文并翻译 | 示例
       

摘要

Multimodal hashing, which conducts effective and efficient nearest neighbor search across heterogeneous data on large-scale multimedia databases, has been attracting increasing interest, given the explosive growth of multimedia content on the Internet. Recent multimodal hashing research mainly aims at learning the compact binary codes to preserve semantic information given by labels. The overwhelming majority of these methods are similarity preserving approaches which approximate pairwise similarity matrix with Hamming distances between the to-be-learnt binary hash codes. However, these methods ignore the discriminative property in hash learning process, which results in hash codes from different classes undistinguished, and therefore reduces the accuracy and robustness for the nearest neighbor search. To this end, we present a novel multimodal hashing method, named multimodal discriminative binary embedding (MDBE), which focuses on learning discriminative hash codes. First, the proposed method formulates the hash function learning in terms of classification, where the binary codes generated by the learned hash functions are expected to be discriminative. And then, it exploits the label information to discover the shared structures inside heterogeneous data. Finally, the learned structures are preserved for hash codes to produce similar binary codes in the same class. Hence, the proposed MDBE can preserve both discriminability and similarity for hash codes, and will enhance retrieval accuracy. Thorough experiments on benchmark data sets demonstrate that the proposed method achieves excellent accuracy and competitive computational efficiency compared with the state-of-the-art methods for large-scale cross-modal retrieval task.
机译:考虑到Internet上多媒体内容的爆炸性增长,多模式哈希在大型多媒体数据库中的异构数据之间进行有效,高效的最近邻搜索,引起了越来越多的兴趣。最近的多峰哈希研究主要旨在学习紧凑的二进制代码以保留标签给出的语义信息。这些方法中的绝大多数是相似性保留方法,其利用要学习的二进制哈希码之间的汉明距离来近似成对相似性矩阵。但是,这些方法忽略了哈希学习过程中的判别属性,这导致无法区分来自不同类别的哈希码,因此降低了最近邻居搜索的准确性和鲁棒性。为此,我们提出了一种新颖的多模式散列方法,称为多模式判别二进制嵌入(MDBE),该方法着重于学习判别性散列码。首先,所提出的方法根据分类表述了哈希函数学习,其中由学习的哈希函数生成的二进制代码预期是有区别的。然后,它利用标签信息发现异构数据内部的共享结构。最后,保留学习的结构用于哈希码,以在同一类中生成相似的二进制代码。因此,提出的MDBE可以保留哈希码的可区分性和相似性,并提高检索的准确性。在基准数据集上进行的充分实验表明,与用于大规模跨模态检索任务的最新方法相比,该方法具有出色的准确性和具有竞争力的计算效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号