首页> 外文期刊>Scientific reports. >A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification
【24h】

A Cluster-then-label Semi-supervised Learning Approach for Pathology Image Classification

机译:一种分类标签半监督学习的病理图像分类方法

获取原文
           

摘要

Completely labeled pathology datasets are often challenging and time-consuming to obtain. Semi-supervised learning (SSL) methods are able to learn from fewer labeled data points with the help of a large number of unlabeled data points. In this paper, we investigated the possibility of using clustering analysis to identify the underlying structure of the data space for SSL. A cluster-then-label method was proposed to identify high-density regions in the data space which were then used to help a supervised SVM in finding the decision boundary. We have compared our method with other supervised and semi-supervised state-of-the-art techniques using two different classification tasks applied to breast pathology datasets. We found that compared with other state-of-the-art supervised and semi-supervised methods, our SSL method is able to improve classification performance when a limited number of labeled data instances are made available. We also showed that it is important to examine the underlying distribution of the data space before applying SSL techniques to ensure semi-supervised learning assumptions are not violated by the data.
机译:完整标记的病理数据集通常具有挑战性,而且很耗时。半监督学习(SSL)方法能够借助大量未标记的数据点从较少的标记数据点学习。在本文中,我们研究了使用聚类分析来确定SSL数据空间的基础结构的可能性。提出了一种“聚簇再标记”的方法来识别数据空间中的高密度区域,然后将其用于帮助受监督的SVM查找决策边界。我们将两种方法应用于乳腺病理数据集,将我们的方法与其他有监督和半监督的最新技术进行了比较。我们发现,与其他最新的监督和半监督方法相比,当有限数量的标记数据实例可用时,我们的SSL方法能够提高分类性能。我们还表明,在应用SSL技术之前检查数据空间的基础分布很重要,以确保数据不违反半监督学习假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号