首页> 外文会议>Asian conference on computer vision >Understanding Convolutional Neural Networks in Terms of Category-Level Attributes
【24h】

Understanding Convolutional Neural Networks in Terms of Category-Level Attributes

机译:从类别级属性了解卷积神经网络

获取原文

摘要

It has been recently reported that convolutional neural networks (CNNs) show good performances in many image recognition tasks. They significantly outperform the previous approaches that are not based on neural networks particularly for object category recognition. These performances are arguably owing to their ability of discovering better image features for recognition tasks through learning, resulting in the acquisition of better internal representations of the inputs. However, in spite of the good performances, it remains an open question why CNNs work so well and/or how they can learn such good representations. In this study, we conjecture that the learned representation can be interpreted as category-level attributes that have good properties. We conducted several experiments by using the dataset AwA (Animals with Attributes) and a CNN trained for ILSVRC-2012 in a fully supervised setting to examine this conjecture. We report that there exist units in the CNN that can predict some of the 85 semantic attributes fairly accurately, along with a detailed observation that this is true only for visual attributes and not for non-visual ones. It is more natural to think that the CNN may discover not only semantic attributes but non-semantic ones (or ones that are difficult to represent as a word). To explore this possibility, we perform zero-shot learning by regarding the activation pattern of upper layers as attributes describing the categories. The result shows that it outperforms the state-of-the-art with a significant margin.
机译:最近有报道说,卷积神经网络(CNN)在许多图像识别任务中表现出良好的性能。它们明显优于以前的基于神经网络的方法,尤其是在对象类别识别方面。这些性能可以说是由于它们有能力通过学习发现用于识别任务的更好的图像特征,从而获得了更好的输入内部表示。但是,尽管表现良好,但CNN为何如此运作良好和/或如何学习这种良好的表示形式仍是一个悬而未决的问题。在这项研究中,我们推测学习的表示形式可以解释为具有良好属性的类别级属性。我们通过使用数据集AwA(具有属性的动物)和为ILSVRC-2012训练的CNN在完全监督的情况下进行了几次实验,以检验这一猜想。我们报告CNN中存在可以相当准确地预测85个语义属性中的一些属性的单元,同时还详细观察到,这仅对视觉属性适用,而对非视觉属性则适用。更自然地认为CNN不仅可以发现语义属性,而且可以发现非语义属性(或难以用单词表示的属性)。为了探索这种可能性,我们通过将上层的激活模式视为描述类别的属性来执行零射学习。结果表明,它以最大的幅度超过了最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号