首页> 美国卫生研究院文献>Scientific Reports >A novel logistic regression model combining semi-supervised learning and active learning for disease classification
【2h】

A novel logistic regression model combining semi-supervised learning and active learning for disease classification

机译:半监督学习与主动学习相结合的新型逻辑回归模型用于疾病分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Traditional supervised learning classifier needs a lot of labeled samples to achieve good performance, however in many biological datasets there is only a small size of labeled samples and the remaining samples are unlabeled. Labeling these unlabeled samples manually is difficult or expensive. Technologies such as active learning and semi-supervised learning have been proposed to utilize the unlabeled samples for improving the model performance. However in active learning the model suffers from being short-sighted or biased and some manual workload is still needed. The semi-supervised learning methods are easy to be affected by the noisy samples. In this paper we propose a novel logistic regression model based on complementarity of active learning and semi-supervised learning, for utilizing the unlabeled samples with least cost to improve the disease classification accuracy. In addition to that, an update pseudo-labeled samples mechanism is designed to reduce the false pseudo-labeled samples. The experiment results show that this new model can achieve better performances compared the widely used semi-supervised learning and active learning methods in disease classification and gene selection.
机译:传统的监督学习分类器需要大量的标记样本才能实现良好的性能,但是在许多生物学数据集中,标记样本的尺寸很小,其余样本未标记。手动标记这些未标记的样品是困难的或昂贵的。已经提出了诸如主动学习和半监督学习之类的技术来利用未标记的样本来改善模型性能。但是,在主动学习中,该模型存在短视或偏见的问题,仍然需要一些手动工作量。半监督学习方法容易受到噪声样本的影响。在本文中,我们提出了一种基于主动学习和半监督学习的互补性的新型逻辑回归模型,用于以最小的成本利用未标记的样本来提高疾病分类的准确性。除此之外,还设计了一种更新的伪标记样本机制,以减少错误的伪标记样本。实验结果表明,与广泛使用的半监督学习和主动学习方法相比,该新模型在疾病分类和基因选择上具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号