首页> 外文会议>International Joint Conference on Artificial Intelligence >Similarity-Based Approach for Positive and Unlabelled Learning
【24h】

Similarity-Based Approach for Positive and Unlabelled Learning

机译:基于相似性的积极和未标记学习方法

获取原文

摘要

Positive and unlabelled learning (PU learning) has been investigated to deal with the situation where only the positive examples and the unlabelled examples are available. Most of the previous works focus on identifying some negative examples from the unlabelled data, so that the supervised learning methods can be applied to build a classifier. However, for the remaining unlabelled data, which can not be explicitly identified as positive or negative (we call them ambiguous examples), they either exclude them from the training phase or simply enforce them to either class. Consequently, their performance may be constrained. This paper proposes a novel approach, called similarity-based PU learning (SPUL) method, by associating the ambiguous examples with two similarity weights, which indicate the similarity of an ambiguous example towards the positive class and the negative class, respectively. The local similarity-based and global similarity-based mechanisms are proposed to generate the similarity weights. The ambiguous examples and their similarity-weights are thereafter incorporated into an SVM-based learning phase to build a more accurate classifier. Extensive experiments on real-world datasets have shown that SPUL outperforms state-of-the-art PU learning methods.
机译:积极的和未标记学习(PU学习)已被调查,处理,其中只有正例未标记的例子是可用的局面。大部分以前的工作集中于识别来自未标记的数据的一些反例,从而使监督学习方法可以应用于构建的分类器。然而,对于其余未标记的数据,这些数据不能被明确标识为阳性或阴性(我们称之为模糊的例子),他们要么排除他们从训练阶段或只是他们执行要么类。因此,他们的表现可能会受到限制。本文提出了一种新颖的方法,称为基于相似性的PU学习(SPUL)方法中,通过具有两个相似性的权重的暧昧实施例,其表示一个模糊的示例的分别朝向正类和负类,相似性关联。当地的相似性为基础,提出了基于全局相似度的机制来产生相似性权重。暧昧的例子和它们的相似性权重随后并入到基于SVM的学习阶段建立一个更准确的分类器。真实世界的数据集大量的实验表明,SPUL性能优于国家的最先进的PU学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号