首页> 外文会议>Annual conference on Neural Information Processing Systems >Estimating the class prior and posterior from noisy positives and unlabeled data
【24h】

Estimating the class prior and posterior from noisy positives and unlabeled data

机译:从嘈杂的阳性结果和未标记的数据估计课前和课后

获取原文

摘要

We develop a classification algorithm for estimating posterior distributions from positive-unlabeled data, that is robust to noise in the positive labels and effective for high-dimensional data. In recent years, several algorithms have been proposed to learn from positive-unlabeled data; however, many of these contributions remain theoretical, performing poorly on real high-dimensional data that is typically contaminated with noise. We build on this previous work to develop two practical classification algorithms that explicitly model the noise in the positive labels and utilize univariate transforms built on discriminative classifiers. We prove that these univariate transforms preserve the class prior, enabling estimation in the univariate space and avoiding kernel density estimation for high-dimensional data. The theoretical development and parametric and nonparametric algorithms proposed here constitute an important step towards wide-spread use of robust classification algorithms for positive-unlabeled data.
机译:我们开发了一种分类算法,用于根据未标记的阳性数据估计后验分布,该算法对阳性标签中的噪声具有鲁棒性,并且对高维数据有效。近年来,已经提出了几种从未标记的阳性数据中学习的算法。然而,这些贡献中有许多仍然是理论上的,在通常被噪声污染的真实高维数据上表现不佳。我们在之前的工作基础上开发了两种实用的分类算法,这些算法对正标签中的噪声进行显式建模,并利用基于判别式分类器的单变量变换。我们证明了这些单变量变换先保留了类,从而可以在单变量空间中进行估计,并且避免了对高维数据进行核密度估计。本文提出的理论发展以及参数和非参数算法构成了朝着广泛使用鲁棒分类算法处理未标记正数据的重要一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号