Differential Privacy for Positive and Unlabeled Learning With Known Class Priors

机译：用已知的类前锋积极和未标记的学习的差异隐私

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the increasing attention to big data, there are several domains where labeled data is scarce or too costly to obtain. For example, for data from information retrieval, gene analysis, and social network analysis, only training samples from the positive class are annotated while the remaining unlabeled training samples consist of both unlabeled positive and unlabeled negative samples. The specific positive and unlabeled (PU) data from those domains necessitates a mechanism to learn a two-class classifier from only one-class labeled data. Moreover, because data from those domains is highly sensitive and private, preserving training samples privacy is essential. This paper addresses the challenge of private PU learning by designing a differentially private algorithm for positive and unlabeled data. We first propose a learning framework for the PU setting when the class prior probability is known, with a theoretical guarantee of convergence to the optimal classifier. We then propose a privacy-preserving mechanism for the designed framework where the privacy and utility are both theoretically and empirically proved.

机译：尽管对大数据的关注越来越高，但有几个域名可以稀缺或太昂贵的数据。例如，对于来自信息检索，基因分析和社交网络分析的数据，只有来自正类的训练样本被注释，而剩余的未标记的训练样本由未标记的阳性和未标记的阴性样品组成。来自这些域的特定的正和未标记的（PU）数据需要一个机制，用于从仅从一类标记数据中学习两级分类器。此外，由于来自这些域的数据是高度敏感和私密的，因此保留培训样本隐私至关重要。本文通过设计用于正和未标记的数据的差异私有算法来解决私有PU学习的挑战。我们首先提出了当前概率的PU设置的学习框架，具有对最佳分类器的理论保证。然后，我们为设计框架提出了隐私保留机制，其中隐私和实用在理论上和经验证明。

著录项

来源
《IEEE Statistical Signal Processing Workshop》|2018年|860p|共5页
会议地点
作者
Anh T. Pham; Raviv Raich;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN911.7-53;
关键词
Signal processing algorithms; Linear programming; Training; Conferences; Signal processing;

机译：信号处理算法;线性编程;培训;会议;信号处理;

相似文献

外文文献
中文文献
专利

1. Class-prior estimation for learning from positive and unlabeled data [J] . du Plessis Marthinus C., Niu Gang, Sugiyama Masashi Machine Learning . 2017,第4期

机译：从阳性和未标记数据中学习的班级在先估计
2. Learning from Positive and Unlabeled Data 2: Computationally Efficient Estimation of Class Priors [J] . Marthinus Christoffel DU PLESSIS, Gang NIU, Masashi SUGIYAMA 電子情報通信学会技術研究報告. 情報論的学習理論と機械学習 . 2014,第306期

机译：从积极的和未标记的数据中学习2：班级先验的计算有效估计
3. Positive Unlabeled Learning Algorithm for One Class Classification of Social Text Stream with only very few Positive Training Samples [J] . Abhinandan Vishwakarma Computer Engineering and Intelligent Systems . 2015,第3期

机译：仅有很少的积极训练样本的社会文本流的一类分类的正面无标签学习算法
4. Differential Privacy for Positive and Unlabeled Learning With Known Class Priors [C] . Anh T. Pham, Raviv Raich IEEE Statistical Signal Processing Workshop . 2018

机译：具有已知课程先验的积极和无标签学习的差异隐私
5. Algorithms and Approaches for Positive-Unlabeled Learning [D] . Jain, Shantanu. 2018

机译：积极无标签学习的算法和方法
6. Estimating classification accuracy in positive-unlabeled learning: characterization and correction strategies [O] . Rashika Ramola, Shantanu Jain, Predrag Radivojac -1

机译：估计阳性无标签学习中的分类准确性：表征和纠正策略
7. Class-prior Estimation for Learning from Positive and Unlabeled Data [O] . Plessis, Marthinus C. du, Niu, Gang, Sugiyama, Masashi 2016

机译：从正数和未标记数据中学习的先验先验估计

Differential Privacy for Positive and Unlabeled Learning With Known Class Priors

摘要

著录项

相似文献

相关主题

期刊订阅