首页> 外文期刊>International Journal of Computer Vision >Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning
【24h】

Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning

机译:零射击学习的类层次结构可转移特征和投影学习

获取原文
获取原文并翻译 | 示例
           

摘要

Zero-shot learning (ZSL) aims to transfer knowledge from seen classes to unseen ones so that the latter can be recognised without any training samples. This is made possible by learning a projection function between a feature space and a semantic space (e.g. attribute space). Considering the seen and unseen classes as two domains, a big domain gap often exists which challenges ZSL. In this work, we propose a novel inductive ZSL model that leverages superclasses as the bridge between seen and unseen classes to narrow the domain gap. Specifically, we first build a class hierarchy of multiple superclass layers and a single class layer, where the superclasses are automatically generated by data-driven clustering over the semantic representations of all seen and unseen class names. We then exploit the superclasses from the class hierarchy to tackle the domain gap challenge in two aspects: deep feature learning and projection function learning. First, to narrow the domain gap in the feature space, we define a recurrent neural network over superclasses and then plug it into a convolutional neural network for enforcing the superclass hierarchy. Second, to further learn a transferrable projection function for ZSL, a novel projection function learning method is proposed by exploiting the superclasses to align the two domains. Importantly, our transferrable feature and projection learning methods can be easily extended to a closely related task-few-shot learning (FSL). Extensive experiments show that the proposed model outperforms the state-of-the-art alternatives in both ZSL and FSL tasks.
机译:零拍摄学习(ZSL)旨在将知识从看到的课程转移到看不见者,以便在没有任何训练样本的情况下可以识别后者。这可以通过学习特征空间和语义空间(例如属性空间)之间的投影函数来实现这一点。考虑到所看到的和看不见的课程作为两个域,大域间隙通常存在挑战ZSL。在这项工作中,我们提出了一种新的电感ZSL模型,使超类作为所看到和看不见的类之间的桥梁来缩小域间隙。具体而言,我们首先构建多个超类层的类层次结构和单个类别层,其中超类是通过数据驱动的聚类自动生成的,通过所有看到的所有所看到和未安装类名的语义表示。然后,我们从班级层次结构中利用超类来解决两个方面的域间隙挑战:深度特征学习和投影功能学习。首先,要缩小特征空间中的域间隙,我们将经常性神经网络定义在超类上,然后将其插入卷积神经网络中,以实施超类层次结构。其次,为了进一步学习用于ZSL的可转移投影功能,通过利用超类来对准两个域来提出一种新的投影函数学习方法。重要的是,我们的可转移功能和投影学习方法可以很容易地扩展到密切相关的任务 - 几次拍摄学习(FSL)。广泛的实验表明,所提出的模型优于ZSL和FSL任务中的最先进的替代品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号