Positive-Unlabeled Learning in the Face of Labeling Bias

机译：面对标签偏见的积极无标签学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Positive-Unlabeled (PU) learning scenarios are a class of semi-supervised learning where only a fraction of the data is labeled, and all available labels are positive. The goal is to assign correct (positive and negative) labels to as much data as possible. Several important learning problems fall into the PU-learning domain, as in many cases the cost and feasibility of obtaining negative examples is prohibitive. In addition to the positive-negative disparity the overall cost of labeling these datasets typically leads to situations where the number of unlabeled examples greatly outnumbers the labeled. Accordingly, we perform several experiments, on both synthetic and real-world datasets, examining the performance of state of the art PU-learning algorithms when there is significant bias in the labeling process. We propose novel PU algorithms and demonstrate that they outperform the current state of the art on a variety of benchmarks. Lastly, we present a methodology for removing the costly parameter-tuning step in a popular PU algorithm.

机译：正面无标签（PU）学习方案是一类半监督学习，其中只有一部分数据被标记，并且所有可用标签都是正面的。目标是为尽可能多的数据分配正确的（正面和负面）标签。一些重要的学习问题属于PU学习领域，因为在许多情况下，获得负面例子的成本和可行性令人望而却步。除了正负差异之外，标记这些数据集的总成本通常会导致未标记示例的数量大大超过标记数量的情况。因此，我们在合成数据集和实际数据集上进行了多次实验，以检查在标记过程中存在明显偏差时，最新的PU学习算法的性能。我们提出了新颖的PU算法，并证明了它们在各种基准上均优于当前的最新技术。最后，我们提出了一种方法，可以消除流行的PU算法中昂贵的参数调整步骤。

著录项

来源
《IEEE International Conference on Data Mining Workshops》|2015年|639-645|共7页
会议地点
作者
Noah Youngs; Dennis Shasha; Richard Bonneau;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Machine Learning; Positive-Unlabeled Learning; Semi-Supervised Learning;

机译：机器学习;正无标签学习;半监督学习;
入库时间 2022-08-26 15:22:59

相似文献

外文文献
中文文献
专利

1. AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications [J] . Yang Pengyi, Ormerod John T., Liu Wei, Cybernetics, IEEE Transactions on . 2019,第5期

机译：AdaSampling用于使用生物信息学应用程序进行正无标签和标签噪声的学习
2. Information-Theoretic Representation Learning for Positive-Unlabeled Classification [J] . Tomoya Sakai, Gang Niu, Masashi Sugiyama Neural computation . 2021,第1期

机译：信息理论代表学习积极解析分类
3. Principled analytic classifier for positive-unlabeled learning via weighted integral probability metric [J] . Kwon Yongchan, Kim Wonyoung, Sugiyama Masashi, Machine Learning . 2020,第3期

机译：加权积分概率度量用于正无标签学习的原理化分析分类器
4. Positive-Unlabeled Learning in the Face of Labeling Bias [C] . Noah Youngs, Dennis Shasha, Richard Bonneau IEEE International Conference on Data Mining Workshops . 2015

机译：面对标签偏见的正面未标记的学习
5. Algorithms and Approaches for Positive-Unlabeled Learning [D] . Jain, Shantanu. 2018

机译：积极无标签学习的算法和方法
6. Predicting HIV-1 Protease Cleavage Sites With Positive-Unlabeled Learning [O] . Zhenfeng Li, Lun Hu, Zehai Tang, 2021

机译：预测HIV-1蛋白酶切割位点具有积极 - 未标记的学习
7. AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications [O] . Pengyi Yang, John T. Ormerod, Wei Liu, 2019

机译：具有生物信息学应用的正面未标记和标签噪声学习的互动

Positive-Unlabeled Learning in the Face of Labeling Bias

摘要

著录项

相似文献

相关主题

期刊订阅