首页> 外文期刊>Multimedia Tools and Applications >Semantically-enhanced kernel canonical correlation analysis: a multi-label cross-modal retrieval
【24h】

Semantically-enhanced kernel canonical correlation analysis: a multi-label cross-modal retrieval

机译:语义增强的核规范相关分析:多标签交叉模式检索

获取原文
获取原文并翻译 | 示例
           

摘要

Aiming at measuring the inter-media semantic similarities, cross-modal retrieval tries to align heterogenous features to an intermediate common subspace in which they can be reasonably compared. This is based on the same understanding of the semantics which are represented by different modalities. However, the semantics can usually be reflected by multiple concepts since concepts co-occur in real-world rather than occur in isolation. This leads to a more challenging task of multi-label cross-modal retrieval in which multiple concepts are annotated as labels for images as an example. More importantly, the co-occurrence patterns of concepts result in correlated pairs of labels whose relationships need to be considered in an accurate cross-modal retrieval. In this paper, we propose multi-label kernel canonical correlation analysis (ml-KCCA), a novel approach for cross-modal retrieval which enhances kernel CCA with high-level semantic information reflected in multi-label annotations. By kernelizing correlation extraction from multi-label information, more complex non-linear correlations between different modalities can be measured in order to learn a discriminative subspace which is more suitable for cross-modal retrieval tasks. Extensive evaluations on public datasets have validated the improvements of our approach over the state-of-the-art cross-modal retrieval approaches including other CCA extensions.
机译:为了测量媒体之间的语义相似性,跨模式检索尝试将异构特征与中间公共子空间对齐,在中间子空间中可以合理地比较它们。这是基于对不同形式所代表的语义的相同理解。但是,语义通常可以由多个概念反映,因为概念在现实世界中同时发生,而不是孤立地出现。这导致了更具挑战性的多标签交叉模式检索任务,其中将多个概念标注为图像标签作为示例。更重要的是,概念的共现模式导致标签的相关对,在精确的跨模式检索中需要考虑它们之间的关系。在本文中,我们提出了多标签内核规范相关分析(ml-KCCA),这是一种用于跨模式检索的新方法,该方法利用多标签注释中反映的高级语义信息增强了内核CCA。通过对从多标签信息中提取相关性进行核化,可以测量不同模态之间更复杂的非线性相关性,以便学习更适合跨模态检索任务的判别子空间。对公共数据集的广泛评估已经验证了我们的方法相对于最新的交叉模式检索方法(包括其他CCA扩展)的改进。

著录项

  • 来源
    《Multimedia Tools and Applications》 |2019年第10期|13169-13188|共20页
  • 作者单位

    Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China;

    Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China;

    Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China;

    Tsinghua Univ, Dept Comp Sci & Technol, Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China;

    Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China;

    Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Cross-modal retrieval; Kernel CCA; Multi-label information; Concept correlations;

    机译:跨模式检索;内核CCA;多标签信息;概念相关;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号