首页> 外文OA文献 >Multi-modal curriculum learning for semi-supervised image classification
【2h】

Multi-modal curriculum learning for semi-supervised image classification

机译:用于半监督图像分类的多模式课程学习

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Semi-supervised image classification aims to classify a large quantity of unlabeled images by typically harnessing scarce labeled images. Existing semi-supervised methods often suffer from inadequate classification accuracy when encountering difficult yet critical images, such as outliers, because they treat all unlabeled images equally and conduct classifications in an imperfectly ordered sequence. In this paper, we employ the curriculum learning methodology by investigating the difficulty of classifying every unlabeled image. The reliability and the discriminability of these unlabeled images are particularly investigated for evaluating their difficulty. As a result, an optimized image sequence is generated during the iterative propagations, and the unlabeled images are logically classified from simple to difficult. Furthermore, since images are usually characterized by multiple visual feature descriptors, we associate each kind of features with a teacher, and design a multi-modal curriculum learning (MMCL) strategy to integrate the information from different feature modalities. In each propagation, each teacher analyzes the difficulties of the currently unlabeled images from its own modality viewpoint. A consensus is subsequently reached among all the teachers, determining the currently simplest images (i.e., a curriculum), which are to be reliably classified by the multi-modal learner. This well-organized propagation process leveraging multiple teachers and one learner enables our MMCL to outperform five state-of-the-art methods on eight popular image data sets.
机译:半监督图像分类旨在通过通常利用稀缺的标记图像来对大量未标记图像进行分类。现有的半监督方法在遇到困难而又关键的图像(例如离群值)时,常常会遇到分类精度不足的问题,因为它们会平等地对待所有未标记的图像,并以不完全有序的顺序进行分类。在本文中,我们通过调查分类每个未标记图像的难度来采用课程学习方法。对这些未标记图像的可靠性和可分辨性进行了专门研究,以评估其难度。结果,在迭代传播期间生成了优化的图像序列,并且未标记的图像在逻辑上从简单到困难进行了分类。此外,由于图像通常由多个视觉特征描述符表征,因此我们将每种特征与老师关联,并设计一种多模式课程学习(MMCL)策略以整合来自不同特征模态的信息。在每次传播中,每位教师都从其自身的模态角度分析当前未标记图像的困难。随后在所有教师中达成共识,确定当前最简单的图像(即课程表),这些图像将由多模式学习者可靠地分类。这种组织有序的传播过程利用了多位老师和一名学习者的力量,使我们的MMCL在八种流行的图像数据集上表现出五种最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号