首页> 外文会议>IEEE Statistical Signal Processing Workshop >Differential Privacy for Positive and Unlabeled Learning With Known Class Priors
【24h】

Differential Privacy for Positive and Unlabeled Learning With Known Class Priors

机译:用已知的类前锋积极和未标记的学习的差异隐私

获取原文

摘要

Despite the increasing attention to big data, there are several domains where labeled data is scarce or too costly to obtain. For example, for data from information retrieval, gene analysis, and social network analysis, only training samples from the positive class are annotated while the remaining unlabeled training samples consist of both unlabeled positive and unlabeled negative samples. The specific positive and unlabeled (PU) data from those domains necessitates a mechanism to learn a two-class classifier from only one-class labeled data. Moreover, because data from those domains is highly sensitive and private, preserving training samples privacy is essential. This paper addresses the challenge of private PU learning by designing a differentially private algorithm for positive and unlabeled data. We first propose a learning framework for the PU setting when the class prior probability is known, with a theoretical guarantee of convergence to the optimal classifier. We then propose a privacy-preserving mechanism for the designed framework where the privacy and utility are both theoretically and empirically proved.
机译:尽管对大数据的关注越来越高,但有几个域名可以稀缺或太昂贵的数据。例如,对于来自信息检索,基因分析和社交网络分析的数据,只有来自正类的训练样本被注释,而剩余的未标记的训练样本由未标记的阳性和未标记的阴性样品组成。来自这些域的特定的正和未标记的(PU)数据需要一个机制,用于从仅从一类标记数据中学习两级分类器。此外,由于来自这些域的数据是高度敏感和私密的,因此保留培训样本隐私至关重要。本文通过设计用于正和未标记的数据的差异私有算法来解决私有PU学习的挑战。我们首先提出了当前概率的PU设置的学习框架,具有对最佳分类器的理论保证。然后,我们为设计框架提出了隐私保留机制,其中隐私和实用在理论上和经验证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号