Differential Privacy for Positive and Unlabeled Learning With Known Class Priors

机译：具有已知课程先验的积极和无标签学习的差异隐私

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the increasing attention to big data, there are several domains where labeled data is scarce or too costly to obtain. For example, for data from information retrieval, gene analysis, and social network analysis, only training samples from the positive class are annotated while the remaining unlabeled training samples consist of both unlabeled positive and unlabeled negative samples. The specific positive and unlabeled (PU) data from those domains necessitates a mechanism to learn a two-class classifier from only one-class labeled data. Moreover, because data from those domains is highly sensitive and private, preserving training samples privacy is essential. This paper addresses the challenge of private PU learning by designing a differentially private algorithm for positive and unlabeled data. We first propose a learning framework for the PU setting when the class prior probability is known, with a theoretical guarantee of convergence to the optimal classifier. We then propose a privacy-preserving mechanism for the designed framework where the privacy and utility are both theoretically and empirically proved.

机译：尽管人们越来越关注大数据，但在某些领域中，标记数据却很少或获取成本太高。例如，对于来自信息检索，基因分析和社交网络分析的数据，仅注释来自阳性类别的训练样本，而其余未标记的训练样本由未标记的阳性和未标记的阴性样本组成。来自这些域的特定正数和未标记（PU）数据需要一种仅从一类标记数据中学习两类分类器的机制。此外，由于来自这些域的数据高度敏感且私有，因此保留培训样本的隐私至关重要。本文通过针对阳性和未标记数据设计差分私有算法来解决私有PU学习的挑战。当类优先级概率已知时，我们首先为PU设置提出一个学习框架，从理论上保证可以收敛到最优分类器。然后，我们为设计的框架提出了一种隐私保护机制，其中在理论上和经验上都证明了隐私和实用性。

著录项

来源
《IEEE Statistical Signal Processing Workshop》|2018年|85-89|共5页
会议地点
作者
Anh T. Pham; Raviv Raich;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Signal processing algorithms; Linear programming; Training; Conferences; Signal processing;

机译：信号处理算法;线性编程;培训;会议;信号处理;
入库时间 2022-08-26 15:12:31

相似文献

外文文献
中文文献
专利

1. Class-prior estimation for learning from positive and unlabeled data [J] . du Plessis Marthinus C., Niu Gang, Sugiyama Masashi Machine Learning . 2017,第4期

机译：从阳性和未标记数据中学习的班级在先估计
2. Learning from Positive and Unlabeled Data 2: Computationally Efficient Estimation of Class Priors [J] . Marthinus Christoffel DU PLESSIS, Gang NIU, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2014,第306期

机译：从积极的和未标记的数据中学习2：班级先验的计算有效估计
3. Positive Unlabeled Learning Algorithm for One Class Classification of Social Text Stream with only very few Positive Training Samples [J] . Abhinandan Vishwakarma Computer Engineering and Intelligent Systems . 2015,第3期

机译：仅有很少的积极训练样本的社会文本流的一类分类的正面无标签学习算法
4. Differential Privacy for Positive and Unlabeled Learning With Known Class Priors [C] . Anh T. Pham, Raviv Raich IEEE Statistical Signal Processing Workshop . 2018

机译：用已知的类前锋积极和未标记的学习的差异隐私
5. Algorithms and Approaches for Positive-Unlabeled Learning [D] . Jain, Shantanu. 2018

机译：积极无标签学习的算法和方法
6. Estimating classification accuracy in positive-unlabeled learning: characterization and correction strategies [O] . Rashika Ramola, Shantanu Jain, Predrag Radivojac -1

机译：估计阳性无标签学习中的分类准确性：表征和纠正策略
7. Class-prior Estimation for Learning from Positive and Unlabeled Data [O] . Plessis, Marthinus C. du, Niu, Gang, Sugiyama, Masashi 2016

机译：从正数和未标记数据中学习的先验先验估计

Differential Privacy for Positive and Unlabeled Learning With Known Class Priors

摘要

著录项

相似文献

相关主题

期刊订阅