首页> 外文期刊>Image Processing, IEEE Transactions on >Image Annotation by Input–Output Structural Grouping Sparsity
【24h】

Image Annotation by Input–Output Structural Grouping Sparsity

机译:输入-输出结构分组稀疏度的图像注释

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.
机译:自动图像注释(AIA)对于图像检索和图像理解非常重要。本文详细探讨了友邦保险中的两个关键问题,即结构化视觉特征选择和多个标签之间的分层相关结构的实现,以提高图像注释的性能。本文同时将输入和输出结构分组稀疏性引入到用于图像标注的正则化回归模型中。对于输入的高维异构特征(例如颜色,纹理和形状),不同种类(组)的特征对于识别某些概念具有不同的内在判别能力。通过结构分组稀疏性提出的结构化特征选择不仅可以用于选择特征组,还可以用于进行组内选择。输出标签之间的层次相关性很好地由树结构表示,因此,所提出的树结构分组稀疏性可用于提高多标签图像注释的性能。为了有效地解决建议的回归模型,我们通过选择具有结构分组稀疏性的异类特征(Bi-MtBGS)来放松求解过程,将其作为双层回归模型的框架进行多标签增强。第一层回归是为每个标签选择判别特征。第二层回归的目的是完善从第一层学到的特征选择模型,该模型可以被视为多标签增强过程。在公共基准图像数据集和真实世界图像数据集上的大量实验表明,该方法具有更好的多标签图像注释性能,并为图像理解提供了一个可解释的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号