...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Local image tagging via graph regularized joint group sparsity
【24h】

Local image tagging via graph regularized joint group sparsity

机译:通过图形正则联合组稀疏性进行局部图像标记

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, massive amounts of web image data have been emerging on the web. How to precisely label these images is critical and challenging to modern image search engines. Due to the fact that web image contents are more and more complex, existing image-level tagging methods may become less effective and hardly achieve satisfactory performance. This raises an urgent need for the fine-grained tagging, e.g., region-level tagging. In this work, we study how to establish mapping between tags and image regions. In particular, a novel hierarchical local image tagging method is proposed to simultaneously assign tags to all the regions within the same image. We propose a Laplacian Joint Group Lasso (LJGL) model to jointly reconstruct the regions within a test image with a set of labeled training data. The LJGL model not only considers the robust encoding ability of joint group lasso but also preserves local structural information embedded in test regions. Besides, we extend the LJGL model to a kernel version in order to achieve the non-linear reconstruction. An effective algorithm is devised to optimize the objective function of the proposed model. Tags of training data are propagated to the reconstructed regions according to the reconstruction coefficients. Extensive experiments on four public image datasets demonstrate that our proposed models achieve significant performance improvements over the state-of-the-art methods in local image tagging.
机译:近年来,网络上已经出现了大量的网络图像数据。如何精确标记这些图像对于现代图像搜索引擎而言至关重要且具有挑战性。由于Web图像内容越来越复杂的事实,现有的图像级标记方法可能变得无效,并且很难获得令人满意的性能。这就迫切需要细粒度的标记,例如区域级标记。在这项工作中,我们研究如何在标签和图像区域之间建立映射。特别地,提出了一种新颖的分层局部图像标记方法,以将标记同时分配给同一图像内的所有区域。我们提出了拉普拉斯联合组套索(LJGL)模型,以使用一组标记的训练数据共同重建测试图像内的区域。 LJGL模型不仅考虑了联合组套索的强大编码能力,还保留了嵌入测试区域中的局部结构信息。此外,我们将LJGL模型扩展到内核版本,以实现非线性重构。设计了一种有效的算法来优化所提出模型的目标函数。训练数据的标签根据重建系数传播到重建区域。在四个公共图像数据集上进行的大量实验表明,我们提出的模型相对于本地图像标记中的最新方法实现了显着的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号