首页> 外文会议>International joint conference on artificial intelligence;IJCAI-11 >Similarity-Based Approach for Positive and Unlabelled Learning
【24h】

Similarity-Based Approach for Positive and Unlabelled Learning

机译:基于相似度的积极和无标签学习方法

获取原文

摘要

Positive and unlabelled learning (PU learning) has been investigated to deal with the situation where only the positive examples and the unlabelled examples are available. Most of the previous works focus on identifying some negative examples from the unlabelled data, so that the supervised learning methods can be applied to build a classifier. However, for the remaining unlabelled data, which can not be explicitly identified as positive or negative (we call them ambiguous examples), they either exclude them from the training phase or simply enforce them to either class. Consequently, their performance may be constrained. This paper proposes a novel approach, called similarity-based PU learning (SPUL) method, by associating the ambiguous examples with two similarity weights, which indicate the similarity of an ambiguous example towards the positive class and the negative class, respectively. The local similarity-based and global similarity-based mechanisms are proposed to generate the similarity weights. The ambiguous examples and their similarity-weights are thereafter incorporated into an SVM-based learning phase to build a more accurate classifier. Extensive experiments on real-world datasets have shown that SPUL outperforms state-of-the-art PU learning methods.
机译:已经研究了积极的和未标记的学习(PU学习)以应对只有积极的示例和未标记的示例可用的情况。以前的大部分工作都集中于从未标记的数据中识别一些负面的例子,因此可以将监督学习方法应用于构建分类器。但是,对于无法明确标识为肯定或否定的剩余未标记数据(我们称其为模棱两可的示例),它们要么将其排除在训练阶段,要么将其强制执行为任一类。因此,它们的性能可能受到限制。本文通过将歧义示例与两个相似度权重相关联,提出了一种新方法,称为基于相似性的PU学习(SPUL)方法,这两个歧义权重分别指示歧义示例对正类别和负类别的相似性。提出了基于局部相似度和全局相似度的机制来产生相似度权重。此后,歧义示例及其相似性权重将合并到基于SVM的学习阶段,以建立更准确的分类器。在现实世界的数据集上进行的大量实验表明,SPUL的性能优于最新的PU学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号