首页> 外文期刊>電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 >Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data
【24h】

Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data

机译:基于来自正数据和未标记数据的分类的半监督分类

获取原文
获取原文并翻译 | 示例
       

摘要

Most of the semi-supervised learning methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the manifold assumption. On the other hand, recently developed methods of learning from positive and unlabeled data (PU learning) use unlabeled data for loss evaluation, i.e., label information is directly extracted from unlabeled data. In this paper, we extend PU learning to also incorporate negative data and propose a novel semi-supervised learning approach. We establish a generalization error bound for our novel method and show that the bound decreases with respect to the number of unlabeled data without the distributional assumptions that are required in existing semi-supervised learning methods. Through experiments, we demonstrate the usefulness of the proposed method.
机译:到目前为止,大多数开发的半监督学习方法都是在特定的分布假设(例如流形假设)下将未标记的数据用于正则化目的。另一方面,最近开发的从阳性和未标记数据中学习的方法(PU学习)使用未标记数据进行损失评估,即,直接从未标记数据中提取标记信息。在本文中,我们将PU学习扩展为也包含负数据,并提出了一种新颖的半监督学习方法。我们为我们的新方法建立了一个泛化误差界,并表明该界线相对于未标记数据的数量减少了,而没有现有的半监督学习方法所需的分布假设。通过实验,我们证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号