首页> 外文OA文献 >Estimating the class prior in positive and unlabeled data through decision tree induction

【2h】

Estimating the class prior in positive and unlabeled data through decision tree induction

机译：通过决策树归纳估计阳性和未标记数据中的类优先级

页面导航

摘要
著录项
相似文献
相关主题

摘要

For tasks such as medical diagnosis and knowledge base completion, a classifier may only have access to positive and unlabeled examples, where the unlabeled data consists of both positive and negative examples. One way that enables learning from this type of data is knowing the true class prior. In this paper, we propose a simple yet effective method for estimating the class prior, by estimating the probability that a positive example is selected to be labeled. Our key insight is that subdomains of the data give a lower bound on this probability. This lower bound gets closer to the real probability as the ratio of labeled examples increases. Finding such subsets can naturally be done via top-down decision tree induction. Experiments show that our method makes estimates which are equivalently accurate as those of the state of the art methods, and is an order of magnitude faster.

机译：对于诸如医学诊断和知识库完成之类的任务，分类器只能访问阳性和未标记的示例，其中未标记的数据包括阳性和阴性的示例。能够从此类数据中学习的一种方法是事先了解真实的课程。在本文中，我们通过估计选择正例被标记的概率，提出了一种简单但有效的方法来估计课前先验。我们的主要见识在于，数据的子域在此概率上给出了下限。随着标记示例比例的增加，该下限越来越接近真实概率。查找此类子集自然可以通过自上而下的决策树归纳来完成。实验表明，我们的方法所做的估算与现有方法的估算相当，且速度快一个数量级。

著录项

作者
Bekker Jessa; Davis Jesse;
展开▼
作者单位

展开▼
年度 2018
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Class-prior estimation for learning from positive and unlabeled data [J] . du Plessis Marthinus C., Niu Gang, Sugiyama Masashi Machine Learning . 2017,第4期

机译：从阳性和未标记数据中学习的班级在先估计
2. Class Prior Estimation from Positive and Unlabeled Data [J] . Marthinus Christoffel DU PLESSIS, Masashi SUGIYAMA IEICE transactions on information and systems . 2014,第5期

机译：从阳性和未标记数据进行班级优先估计
3. Learning from Positive and Unlabeled Data 2: Computationally Efficient Estimation of Class Priors [J] . Marthinus Christoffel DU PLESSIS, Gang NIU, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2014,第306期

机译：从积极的和未标记的数据中学习2：班级先验的计算有效估计
4. Estimating the Class Prior in Positive and Unlabeled Data through Decision Tree Induction [C] . Jessa Bekker, Jesse Davis AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：通过决策树诱导估算正面和未标记数据的课程
5. Knowledge discovery in databases with joint decision outcomes: A decision-tree induction approach. [D] . Chang, Namsik. 1995

机译：具有联合决策结果的数据库中的知识发现：决策树归纳方法。
6. Estimating classification accuracy in positive-unlabeled learning: characterization and correction strategies [O] . Rashika Ramola, Shantanu Jain, Predrag Radivojac -1

机译：估计阳性无标签学习中的分类准确性：表征和纠正策略
7. Class-prior Estimation for Learning from Positive and Unlabeled Data [O] . Plessis, Marthinus C. du, Niu, Gang, Sugiyama, Masashi 2016

机译：从正数和未标记数据中学习的先验先验估计
8. FACE-2 Data Reductions and Analyses (Prior to Disclosure of the Treatment Decisions): Part V. Satellite-Estimated Rainfall from a Geostationary Platform in FACE-2 [R] . Meitin, J. G. , Woodley, W. L. , Griffith, C. G. 1981

机译：FaCE-2数据减少和分析（在披露治疗决定之前）：第五部分：FaCE-2中对地静止平台的卫星估计降雨量

Estimating the class prior in positive and unlabeled data through decision tree induction

摘要

著录项

相似文献

相关主题

期刊订阅