Estimating the class prior and posterior from noisy positives and unlabeled data

机译：从嘈杂的阳性结果和未标记的数据估计课前和课后

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We develop a classification algorithm for estimating posterior distributions from positive-unlabeled data, that is robust to noise in the positive labels and effective for high-dimensional data. In recent years, several algorithms have been proposed to learn from positive-unlabeled data; however, many of these contributions remain theoretical, performing poorly on real high-dimensional data that is typically contaminated with noise. We build on this previous work to develop two practical classification algorithms that explicitly model the noise in the positive labels and utilize univariate transforms built on discriminative classifiers. We prove that these univariate transforms preserve the class prior, enabling estimation in the univariate space and avoiding kernel density estimation for high-dimensional data. The theoretical development and parametric and nonparametric algorithms proposed here constitute an important step towards wide-spread use of robust classification algorithms for positive-unlabeled data.

机译：我们开发了一种分类算法，用于根据未标记的阳性数据估计后验分布，该算法对阳性标签中的噪声具有鲁棒性，并且对高维数据有效。近年来，已经提出了几种从未标记的阳性数据中学习的算法。然而，这些贡献中有许多仍然是理论上的，在通常被噪声污染的真实高维数据上表现不佳。我们在之前的工作基础上开发了两种实用的分类算法，这些算法对正标签中的噪声进行显式建模，并利用基于判别式分类器的单变量变换。我们证明了这些单变量变换先保留了类，从而可以在单变量空间中进行估计，并且避免了对高维数据进行核密度估计。本文提出的理论发展以及参数和非参数算法构成了朝着广泛使用鲁棒分类算法处理未标记正数据的重要一步。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|2693-2701|共9页
会议地点
作者
Shantanu Jain; Martha White; Predrag Radivojac;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Class-prior estimation for learning from positive and unlabeled data [J] . du Plessis Marthinus C., Niu Gang, Sugiyama Masashi Machine Learning . 2017,第4期

机译：从阳性和未标记数据中学习的班级在先估计
2. Class Prior Estimation from Positive and Unlabeled Data [J] . Marthinus Christoffel DU PLESSIS, Masashi SUGIYAMA IEICE transactions on information and systems . 2014,第5期

机译：从阳性和未标记数据进行班级优先估计
3. Learning from Positive and Unlabeled Data 2: Computationally Efficient Estimation of Class Priors [J] . Marthinus Christoffel DU PLESSIS, Gang NIU, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2014,第306期

机译：从积极的和未标记的数据中学习2：班级先验的计算有效估计
4. Estimating the class prior and posterior from noisy positives and unlabeled data [C] . Shantanu Jain, Martha White, Predrag Radivojac Annual conference on Neural Information Processing Systems . 2016

机译：从嘈杂的阳性和未标记数据估算前后和后面的课程
5. On Graph Perturbation Theory and Algorithms for Scalable Mining of Noisy and Uncertain Graph Data with Knowledge Priors. [D] . Hendrix, William Thomas. 2010

机译：图扰动理论和算法用于有知识先验的噪声和不确定图数据的可伸缩挖掘。
6. Estimating classification accuracy in positive-unlabeled learning: characterization and correction strategies [O] . Rashika Ramola, Shantanu Jain, Predrag Radivojac -1

机译：估计阳性无标签学习中的分类准确性：表征和纠正策略
7. Estimating the class prior in positive and unlabeled data through decision tree induction [O] . Bekker Jessa, Davis Jesse 2018

机译：通过决策树归纳估计阳性和未标记数据中的类优先级

Estimating the class prior and posterior from noisy positives and unlabeled data

摘要

著录项

相似文献

相关主题

期刊订阅