...
首页> 外文期刊>Multimedia, IEEE Transactions on >Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval
【24h】

Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval

机译:跨模态多媒体检索的学习一致特征表示

获取原文
获取原文并翻译 | 示例
           

摘要

The cross-modal feature matching has gained much attention in recent years, which has many practical applications, such as the text-to-image retrieval. The most difficult problem of cross-modal matching is how to eliminate the heterogeneity between modalities. The existing methods (e.g., CCA and PLS) try to learn a common latent subspace, where the heterogeneity between two modalities is minimized so that cross-matching is possible. However, most of these methods require fully paired samples and suffer difficulties when dealing with unpaired data. Besides, utilizing the class label information has been found as a good way to reduce the semantic gap between the low-level image features and high-level document descriptions. Considering this, we propose a novel and effective supervised algorithm, which can also deal with the unpaired data. In the proposed formulation, the basis matrices of different modalities are jointly learned based on the training samples. Moreover, a local group-based priori is proposed in the formulation to make a better use of popular block based features (e.g., HOG and GIST). Extensive experiments are conducted on four public databases: Pascal VOC2007, LabelMe, Wikipedia, and NUS-WIDE. We also evaluated the proposed algorithm with unpaired data. By comparing with existing state-of-the-art algorithms, the results show that the proposed algorithm is more robust and achieves the best performance, which outperforms the second best algorithm by about 5% on both the Pascal VOC2007 and NUS-WIDE databases.
机译:跨模式特征匹配近年来受到了广泛的关注,其具有许多实际应用,例如文本到图像的检索。跨模态匹配最困难的问题是如何消除模态之间的异质性。现有方法(例如CCA和PLS)试图学习一个共同的潜在子空间,其中两个模态之间的异质性被最小化,从而可以进行交叉匹配。但是,这些方法大多数都需要完全配对的样本,并且在处理未配对的数据时会遇到困难。此外,已经发现利用类别标签信息是减小低级图像特征和高级文档描述之间的语义鸿沟的一种好方法。考虑到这一点,我们提出了一种新颖有效的监督算法,该算法也可以处理未配对的数据。在提出的公式中,基于训练样本共同学习了不同模态的基础矩阵。此外,在配方中提出了基于局部组的先验,以更好地利用流行的基于块的特征(例如,HOG和GIST)。在四个公共数据库上进行了广泛的实验:Pascal VOC2007,LabelMe,Wikipedia和NUS-WIDE。我们还用不成对的数据评估了提出的算法。通过与现有的最新算法进行比较,结果表明,所提出的算法更加健壮,并实现了最佳性能,在Pascal VOC2007和NUS-WIDE数据库上均比次优算法高出约5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号