首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data
【24h】

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

机译:基于来自正数据和未标记数据的分类的半监督分类

获取原文
       

摘要

Most of the semi-supervised classification methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the cluster assumption. In contrast, recently developed methods of classification from positive and unlabeled data (PU classification) use unlabeled data for risk evaluation, i.e., label information is directly extracted from unlabeled data. In this paper, we extend PU classification to also incorporate negative data and propose a novel semi-supervised learning approach. We establish generalization error bounds for our novel methods and show that the bounds decrease with respect to the number of unlabeled data without the distributional assumptions that are required in existing semi-supervised learning methods. Through experiments, we demonstrate the usefulness of the proposed methods.
机译:迄今为止,大多数半监督分类方法在特定的分布假设(例如聚类假设)下都使用未标记的数据进行正则化。相反,最近开发的从阳性和未标记数据分类的方法(PU分类)使用未标记数据进行风险评估,即,从未标记数据直接提取标签信息。在本文中,我们将PU分类扩展到也包含负数据,并提出了一种新颖的半监督学习方法。我们为我们的新方法建立了泛化误差界,并表明,相对于未标记数据的数量而言,界线减小了,而没有现有的半监督学习方法所需的分布假设。通过实验,我们证明了所提出方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号