首页> 外文期刊>International Journal of Multimedia Information Retrieval >Optimizing visual dictionaries for effective image retrieval
【24h】

Optimizing visual dictionaries for effective image retrieval

机译:优化视觉词典以有效检索图像

获取原文
获取原文并翻译 | 示例
       

摘要

Characterizing images by high-level concepts from a learned visual dictionary is extensively used in image classification and retrieval. This paper deals with inferring discriminative visual dictionaries for effective image retrieval and examines a non-negative visual dictionary learning scheme towards this direction. More specifically, a nonnegative matrix factorization framework with ?_0-sparseness constraint on the coefficient matrix for optimizing the dictionary is proposed. It is a two-step iterative process composed of sparse encoding and dictionary enhancement stages. An initial estimate of the visual dictionary is updated in each iteration with the proposed ?_0-constraint gradient projection algorithm.Adesirable attribute of this formulation is an adaptive sequential dictionary initialization procedure. This leads to a sharp drop down of the approximation error and a faster convergence. Finally, the proposed dictionary optimization scheme is used to derive a compact image representation for the retrieval task. A new image signature is obtained by projecting local descriptors on to the basis elements of the optimized visual dictionary and then aggregating the resulting sparse encodings in to a single feature vector. Experimental results on various benchmark datasets show that the proposed system can infer enhanced visual dictionaries and the derived image feature vector can achieve better retrieval results as compared to state-of-the-art techniques.
机译:通过从学习的视觉词典中获得的高级概念来表征图像,已广泛用于图像分类和检索。本文介绍了可区分的视觉词典的有效检索方法,并研究了朝这一方向发展的非负视觉词典学习方案。更具体地说,提出了一种在系数矩阵上具有α_0稀疏约束的非负矩阵分解框架,用于优化字典。这是一个两步的迭代过程,由稀疏编码和字典增强阶段组成。视觉字典的初始估计值在每次迭代中都使用提出的α_0约束梯度投影算法进行更新。此公式的理想属性是自适应顺序字典初始化过程。这导致逼近误差急剧下降,收敛速度更快。最后,提出的字典优化方案用于导出检索任务的紧凑图像表示。通过将局部描述符投影到优化的视觉词典的基本元素上,然后将所得的稀疏编码聚合到单个特征向量中,可以获取新的图像签名。在各种基准数据集上的实验结果表明,与最新技术相比,该系统可以推断出增强的视觉词典,并且导出的图像特征向量可以实现更好的检索结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号