首页> 外文会议>IEEE International Conference on Data Mining >Similarity-based Active Learning for Image Classification under Class Imbalance
【24h】

Similarity-based Active Learning for Image Classification under Class Imbalance

机译:基于相似性的主动学习,用于在类别不平衡下的图像分类

获取原文
获取外文期刊封面目录资料

摘要

Many image classification tasks (e.g., medical image classification) have a severe class imbalance problem. Convolutional neural network (CNN) is currently a state-of-the-art method for image classification. CNN relies on a large training dataset to achieve high classification performance. However, manual labeling is costly and may not even be feasible for the medical domain. In this paper, we propose a novel similarity-based active deep learning framework (SAL) that deals with class imbalance. SAL actively learns a similarity model to recommend unlabeled rare class samples for experts' manual labeling. Based on similarity ranking, SAL recommends with high confidence unlabeled common class samples for automatic pseudo-labeling without requiring labeling effort by experts. Our experiments show that SAL consistently outperforms two other recent active deep learning methods on two challenging datasets. In addition, SAL obtains nearly the upper bound classification performance (using all the images in the training dataset) for an Endoscopy and the Caltech-256 datasets while the domain experts labeled only 5.6% and 7.5% of all images, respectively. To the best of our knowledge, SAL is the first active deep learning framework that deals with a significant class imbalance and significantly reduces manual labeling efforts by experts while achieving near optimal classification performance.
机译:许多图像分类任务(例如,医学图像分类)有严重的类不平衡问题。卷积神经网络(CNN)是目前用于图像分类的状态下的最先进的方法。 CNN依靠大量训练数据集,以实现高分类性能。然而,手动贴标签是昂贵的并且可能甚至不用于医学领域可行的。在本文中,我们提出了一种新的基于相似性主动深度学习框架(SAL)与类不平衡的交易。 SAL主动获知相似模型,建议未标记的稀有类样本进行专家手工贴标。基于相似性排名,SAL建议以高可信度未标记的通用类样品自动伪标签,而不由专家要求贴标工作。我们的实验表明,SAL一贯优于其他两个最近主动深度学习上的两个挑战数据集的方法。此外,SAL近获得上界的分类性能(使用中的所有训练数据集的图像)用于内窥镜和加州理工学院的-256数据集,而领域专家分别标记为仅5.6%和所有图像的7.5%。据我们所知,SAL是第一个主动深度学习框架,具有显著类不平衡优惠和显著专家降低了手工贴标工作,同时实现了接近最优的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号