...
首页> 外文期刊>IEEE Transactions on Image Processing >A Two-Stage Approach to Few-Shot Learning for Image Recognition
【24h】

A Two-Stage Approach to Few-Shot Learning for Image Recognition

机译:对图像识别的几阶段学习的两阶段方法

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a multi-layer neural network structure for few-shot image recognition of novel categories. The proposed multi-layer neural network architecture encodes transferable knowledge extracted from a large annotated dataset of base categories. This architecture is then applied to novel categories containing only a few samples. The transfer of knowledge is carried out at the feature-extraction and the classification levels distributed across the two training stages. In the first-training stage, we introduce the relative feature to capture the structure of the data as well as obtain a low-dimensional discriminative space. Secondly, we account for the variable variance of different categories by using a network to predict the variance of each class. Classification is then performed by computing the Mahalanobis distance to the mean-class representation in contrast to previous approaches that used the Euclidean distance. In the second-training stage, a category-agnostic mapping is learned from the mean-sample representation to its corresponding class-prototype representation. This is because the mean-sample representation may not accurately represent the novel category prototype. Finally, we evaluate the proposed network structure on four standard few-shot image recognition datasets, where our proposed few-shot learning system produces competitive performance compared to previous work. We also extensively studied and analyzed the contribution of each component of our proposed framework.
机译:本文提出了一种多层神经网络结构,用于几次图像识别新型类别。所提出的多层神经网络架构编码从基本类别的大型注释数据集中提取的可转换知识。然后将该架构应用于仅包含少数样本的新型类别。知识的转移在特征提取和分布在两个训练阶段的分类水平进行。在第一训练阶段,我们介绍了捕获数据结构的相对特征,并获得低维辨别空间。其次,我们通过使用网络来预测每个类的方差来解释不同类别的变量方差。然后通过将Mahalanobis距离计算到平均类表示的距离与使用欧几里德距离的先前方法来执行分类。在第二训练阶段,从平均样本表示到其对应的类原型表示来了解类别无关映射。这是因为平均样本表示可能无法准确表示新颖的类别原型。最后,我们评估了四个标准的少量图像识别数据集上所提出的网络结构,其中我们提出的少量学习系统与以前的工作相比产生竞争性能。我们还广泛地研究并分析了我们提出的框架的每个组成部分的贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号