首页> 外文期刊>Multimedia Systems >Sparse semantic metric learning for image retrieval
【24h】

Sparse semantic metric learning for image retrieval

机译:图像检索的稀疏语义度量学习

获取原文
获取原文并翻译 | 示例
       

摘要

Typical content-based image retrieval solutions usually cannot achieve satisfactory performance due to the semantic gap challenge. With the popularity of social media applications, large amounts of social images associated with user tagging information are available, which can be leveraged to boost image retrieval. In this paper, we propose a sparse semantic metric learning (SSML) algorithm by discovering knowledge from these social media resources, and apply the learned metric to search relevant images for users. Different from the traditional metric learning approaches that use similar or dissimilar constraints over a homogeneous visual space, the proposed method exploits heterogeneous information from two views of images and formulates the learning problem with the following principles. The semantic structure in the text space is expected to be preserved for the transformed space. To prevent overfitting the noisy, incomplete, or subjective tagging information of images, we expect that the mapping space by the learned metric does not deviate from the original visual space. In addition, the metric is straightforward constrained to be row-wise sparse with the ℓ_(2,1)-norm to suppress certain noisy or redundant visual feature dimensions. We present an iterative algorithm with proved convergence to solve the optimization problem. With the learned metric for image retrieval, we conduct extensive experiments on a real-world dataset and validate the effectiveness of our approach compared with other related work.
机译:由于语义差距的挑战,典型的基于内容的图像检索解决方案通常无法获得令人满意的性能。随着社交媒体应用程序的普及,与用户标记信息关联的大量社交图像可用,可以利用这些社交图像来促进图像检索。在本文中,我们通过从这些社交媒体资源中发现知识,提出了一种稀疏语义度量学习(SSML)算法,并将该学习的度量应用于搜索用户的相关图像。与传统的度量学习方法不同,传统的度量学习方法在同质的视觉空间上使用相似或不相似的约束,所提出的方法从图像的两个视图中利用异构信息,并根据以下原理来阐述学习问题。文本空间中的语义结构预计将保留给转换后的空间。为了防止过度拟合图像的嘈杂,不完整或主观标签信息,我们希望通过学习的指标得出的映射空间不会偏离原始视觉空间。另外,该度量直接约束为ℓ_(2,1)-范数行稀疏,以抑制某些嘈杂或多余的视觉特征尺寸。我们提出了一种迭代算法,证明了收敛性,可以解决优化问题。借助所学的图像检索指标,我们对真实数据集进行了广泛的实验,并与其他相关工作相比,验证了我们方法的有效性。

著录项

  • 来源
    《Multimedia Systems》 |2014年第6期|635-643|共9页
  • 作者

    Jing Liu; Zechao Li; Hanqing Lu;

  • 作者单位

    National Lab. of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, No. 95, Zhongguancun East Road, Beijing 100190, People's Republic of China;

    National Lab. of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, No. 95, Zhongguancun East Road, Beijing 100190, People's Republic of China;

    National Lab. of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, No. 95, Zhongguancun East Road, Beijing 100190, People's Republic of China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Sparse metric; Semantic distance metric; Social image; Image retrieval;

    机译:稀疏度量;语义距离度量;社会形象;图像检索;
  • 入库时间 2022-08-18 02:06:16

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号