Positive-Unlabeled Learning in the Face of Labeling Bias

机译：面对标签偏见的正面未标记的学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Positive-Unlabeled (PU) learning scenarios are a class of semi-supervised learning where only a fraction of the data is labeled, and all available labels are positive. The goal is to assign correct (positive and negative) labels to as much data as possible. Several important learning problems fall into the PU-learning domain, as in many cases the cost and feasibility of obtaining negative examples is prohibitive. In addition to the positive-negative disparity the overall cost of labeling these datasets typically leads to situations where the number of unlabeled examples greatly outnumbers the labeled. Accordingly, we perform several experiments, on both synthetic and real-world datasets, examining the performance of state of the art PU-learning algorithms when there is significant bias in the labeling process. We propose novel PU algorithms and demonstrate that they outperform the current state of the art on a variety of benchmarks. Lastly, we present a methodology for removing the costly parameter-tuning step in a popular PU algorithm.

机译：正面未标记的（PU）学习情景是一类半监督学习，其中只标记了一小部分数据，并且所有可用的标签都是正的。目标是将正确的（正负）标签分配给尽可能多的数据。几个重要的学习问题属于PU学习领域，如在许多情况下，获得负例的成本和可行性是令人禁止的。除了正负差异之外，标签这些数据集的总成本通常会导致未标记的例子数量大大寡不一的情况。因此，我们在合成和现实世界数据集上执行若干实验，检查标签过程中存在显着偏差时的艺术PU学习算法状态的性能。我们提出了新的PU算法，并证明了它们在各种基准上越优于现有技术的现有状态。最后，我们介绍了一种用于在流行的PU算法中删除昂贵的参数调整步骤的方法。

著录项

来源
《IEEE International Conference on Data Mining Workshops》|2015年||共7页
会议地点
作者
Noah Youngs; Dennis Shasha; Richard Bonneau;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.2;
关键词
Machine Learning; Positive-Unlabeled Learning; Semi-Supervised Learning;

机译：机器学习;积极解放的学习;半监督学习;

相似文献

外文文献
中文文献
专利

1. AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications [J] . Yang Pengyi, Ormerod John T., Liu Wei, Cybernetics, IEEE Transactions on . 2019,第5期

机译：AdaSampling用于使用生物信息学应用程序进行正无标签和标签噪声的学习
2. Information-Theoretic Representation Learning for Positive-Unlabeled Classification [J] . Tomoya Sakai, Gang Niu, Masashi Sugiyama Neural computation . 2021,第1期

机译：信息理论代表学习积极解析分类
3. Principled analytic classifier for positive-unlabeled learning via weighted integral probability metric [J] . Kwon Yongchan, Kim Wonyoung, Sugiyama Masashi, Machine Learning . 2020,第3期

机译：加权积分概率度量用于正无标签学习的原理化分析分类器
4. Positive-Unlabeled Learning in the Face of Labeling Bias [C] . Noah Youngs, Dennis Shasha, Richard Bonneau IEEE International Conference on Data Mining Workshops . 2015

机译：面对标签偏见的积极无标签学习
5. Algorithms and Approaches for Positive-Unlabeled Learning [D] . Jain, Shantanu. 2018

机译：积极无标签学习的算法和方法
6. Predicting HIV-1 Protease Cleavage Sites With Positive-Unlabeled Learning [O] . Zhenfeng Li, Lun Hu, Zehai Tang, 2021

机译：预测HIV-1蛋白酶切割位点具有积极 - 未标记的学习
7. AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications [O] . Pengyi Yang, John T. Ormerod, Wei Liu, 2019

机译：具有生物信息学应用的正面未标记和标签噪声学习的互动

Positive-Unlabeled Learning in the Face of Labeling Bias

摘要

著录项

相似文献

相关主题

期刊订阅