首页> 外文期刊>International journal of organizational and collective intelligence >A Prediction Approach Based on Self-Training and Deep Learning for Biological Data
【24h】

A Prediction Approach Based on Self-Training and Deep Learning for Biological Data

机译:基于自培训和生物数据深度学习的预测方法

获取原文
获取原文并翻译 | 示例
       

摘要

With the exponential growth of biological data, labeling this kind of data becomes difficult and costly. Although unlabeled data are comparatively more plentiful than labeled ones, most supervised learning methods are not designed to use unlabeled data. Semi-supervised learning methods are motivated by the availability of large unlabeled datasets rather than a small amount of labeled examples. However, incorporating unlabeled data into learning does not guarantee an improvement in classification performance. This paper introduces an approach based on a model of semi-supervised learning, which is the self-training with a deep learning algorithm to predict missing classes from labeled and unlabeled data. In order to assess the performance of the proposed approach, two datasets are used with four performance measures: precision, recall, F-measure, and area under the ROC curve (AUC).
机译:随着生物数据的指数增长,标记这种数据变得困难且昂贵。虽然未标记的数据比标记为更丰富,但大多数监督的学习方法都不设计用于使用未标记的数据。半监督学习方法是通过大型未标记数据集的可用性而不是少量标记的例子。但是,将未标记的数据纳入学习并不能保证提高分类性能。本文介绍了一种基于半监督学习模型的方法,这是具有深度学习算法的自我训练,以预测来自标记和未标记数据的丢失类。为了评估所提出的方法的性能,两个数据集配有四种性能措施:精度,召回,F测量和ROC曲线(AUC)下的区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号