首页> 外文会议>International Joint Conference on Artificial Intelligence >Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition
【24h】

Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition

机译:知识嵌入式识别图像识别的学习

获取原文

摘要

Humans can naturally understand an image in depth with the aid of rich knowledge accumulated from daily lives or professions. For example, to achieve fine-grained image recognition (e.g., categorizing hundreds of subordinate categories of birds) usually requires a comprehensive visual concept organization including category labels and part-level attributes. In this work, we investigate how to unify rich professional knowledge with deep neural network architectures and propose a Knowledge-Embedded Representation Learning (KERL) framework for handling the problem of fine-grained image recognition. Specifically, we organize the rich visual concepts in the form of knowledge graph and employ a Gated Graph Neural Network to propagate node message through the graph for generating the knowledge representation. By introducing a novel gated mechanism, our KERL framework incorporates this knowledge representation into the discriminative image feature learning, i.e., implicitly associating the specific attributes with the feature maps. Compared with existing methods of fine-grained image classification, our KERL framework has several appealing properties: i) The embedded high-level knowledge enhances the feature representation, thus facilitating distinguishing the subtle differences among subordinate categories. ii) Our framework can learn feature maps with a meaningful configuration that the highlighted regions finely accord with the nodes (specific attributes) of the knowledge graph. Extensive experiments on the widely used Caltech-UCSD bird dataset demonstrate the superiority of our KERL framework over existing state-of-the-art methods.
机译:借助从日常生活或职业累积的丰富知识的帮助,人类可以自然地了解图像。例如,为了实现细粒度的图像识别(例如,分类数百个从属类别的鸟类)通常需要一个全面的视觉概念组织,包括类别标签和部分级属性。在这项工作中,我们调查如何通过深度神经网络架构统一丰富的专业知识,并提出了一种知识嵌入式表示学习(KERL)框架,用于处理细粒度的图像识别问题。具体地,我们以知识图形的形式组织丰富的视觉概念,并采用门控图神经网络通过图形传播节点消息,以生成知识表示。通过引入新颖的门控机制,我们的KERL框架将此知识表示包含到鉴别的图像特征学习中,即,隐式将特定属性与特征映射隐式相关联。与现有的细粒度图像分类方法相比,我们的Kerl框架具有多种吸引力的特性:i)嵌入式高级知识增强了特征表示,从而促进了从属类别之间的微妙差异。 ii)我们的框架可以通过有意义的配置学习具有有意义配置的特征映射,即突出显示的区域与知识图的节点(特定属性)精细符合。在广泛使用的CALTECH-UCSD鸟类数据上的广泛实验展示了在现有最先进的方法上的KERL框架的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号