首页> 外文会议>IEEE International Conference on Computer Vision >Learning Query and Image Similarities with Ranking Canonical Correlation Analysis
【24h】

Learning Query and Image Similarities with Ranking Canonical Correlation Analysis

机译:使用排名典范相关分析学习查询和图像相似性

获取原文

摘要

One of the fundamental problems in image search is to learn the ranking functions, i.e., similarity between the query and image. The research on this topic has evolved through two paradigms: feature-based vector model and image ranker learning. The former relies on the image surrounding texts, while the latter learns a ranker based on human labeled query-image pairs. Each of the paradigms has its own limitation. The vector model is sensitive to the quality of text descriptions, and the learning paradigm is difficult to be scaled up as human labeling is always too expensive to obtain. We demonstrate in this paper that the above two limitations can be well mitigated by jointly exploring subspace learning and the use of click-through data. Specifically, we propose a novel Ranking Canonical Correlation Analysis (RCCA) for learning query and image similarities. RCCA initially finds a common subspace between query and image views by maximizing their correlations, and further simultaneously learns a bilinear query-image similarity function and adjusts the subspace to preserve the preference relations implicit in the click-through data. Once the subspace is finalized, query-image similarity can be computed by the bilinear similarity function on their mappings in this subspace. On a large-scale click-based image dataset with 11.7 million queries and one million images, RCCA is shown to be powerful for image search with superior performance over several state-of-the-art methods on both keyword-based and query-by-example tasks.
机译:图像搜索中的一个基本问题是学习排名函数,即查询与图像之间的相似性。对本主题的研究通过两个范式而发展:基于特征的矢量模型和图像排名学习。前者依赖于围绕文本的图像,而后者则基于人类标记的查询图像对的排名。每个范式都有自己的限制。向量模型对文本描述的质量敏感,并且学习范例很难被缩放,因为人类标签总是太昂贵而无法获得。我们在本文中展示,通过共同探索子空间学习和点击数据,可以很好地减轻上述两个限制。具体地,我们提出了一种用于学习查询和图像相似性的新型规范相关分析(RCCA)。 RCCA最初通过最大化其相关性,最初在查询和图像视图之间找到一个常见的子空间,并且进一步同时学习Bilinear查询图像相似函数并调整子空间以保留点击数据中隐式隐式的偏好关系。一旦子空间完成,可以通过此子空间中的映射上的bilinear相似性来计算查询图像相似度。在基于大规模的基于拍摄的图像数据集上,具有1170万个查询和一百万个图像,RCCA被显示为对图像搜索的功能强大,在基于关键字和查询的几种最先进的方法上具有卓越的性能-example任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号