首页> 外文期刊>Pattern recognition letters >Learning label correlations for multi-label image recognition with graph networks
【24h】

Learning label correlations for multi-label image recognition with graph networks

机译:学习与图形网络多标签图像识别的标签相关性

获取原文
获取原文并翻译 | 示例
       

摘要

Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
机译:多标签图像识别是一种任务,可预测图像中的一组对象标签。随着物体在物理世界中共同发生的物体,希望建模标签依赖性。以前的现有方法对此目的进行了复发网络或预定义的标签相关图。在本文中,而不是使用更致力的预定义图,并且可以是多标签分类的次优,我们提出了一个-GCN,它利用自适应标签相关图来利用流行的图表卷积网络来模拟标签依赖关系。具体而言,我们介绍了一个即插即用标签图(LG)模块,以学习与Word Embeddings的标签相关性,然后利用传统的GCN将此图映射到依赖于标签的对象分类器,该图形进一步应用于图像特征。基本LG模块包含两个1 x 1卷积层,并使用DOT产品生成标签图。此外,我们提出了一种稀疏的相关约束来增强LG模块,并探索不同的LG架构。我们在两个不同的多标签数据集上验证我们的方法:MS-Coco和Fashion550K。实验结果表明,我们的A-GCN显着提高了基线方法,并实现了与现有技术的优越或相当的性能。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2020年第10期|378-384|共7页
  • 作者单位

    Southwest Jiaotong Univ Sch Informat Sci & Technol Chengdu Peoples R China|Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Key Lab Comp Vis & Pattern Recognit Shenzhen Peoples R China;

    Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Key Lab Comp Vis & Pattern Recognit Shenzhen Peoples R China;

    Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Key Lab Comp Vis & Pattern Recognit Shenzhen Peoples R China|Shenzhen Inst Artificial Intelligence & Robot Soc SIAT Branch Shenzhen Peoples R China;

    Southwest Jiaotong Univ Sch Informat Sci & Technol Chengdu Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;

    机译:多标签图像识别;图形卷积网络;标签相关图;稀疏相关约束;
  • 入库时间 2022-08-18 21:28:44

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号