首页> 外文会议>Workshop on Privacy in Natural Language Processing >On Primes, Log-Loss Scores and (No) Privacy
【24h】

On Primes, Log-Loss Scores and (No) Privacy

机译:关于素质,日志损失分数和(否)隐私

获取原文

摘要

A common metric for assessing the performance of binary classifiers is the Log-Loss score, which is a real number indicating the cross entropy distance between the predicted distribution over the labels and the true distribution (a point distribution defined by the ground truth labels). In this paper, we show that a malicious modeler, upon obtaining access to the Log-Loss scores on its predictions, can exploit this information to infer all the ground truth labels of arbitrary test datasets with full accuracy. We provide an efficient algorithm to perform this inference. A particularly interesting application where this attack can be exploited is to breach privacy in the setting of Membership Inference Attacks. These attacks exploit the vulnerabilities of exposing models trained on customer data to queries made by an adversary. Privacy auditing tools for measuring leakage from sensitive datasets assess the total privacy leakage based on the adversary's predictions for dat-apoint membership. An instance of the proposed attack can hence, cause complete membership privacy breach, obviating any attack model training or access to side knowledge with the adversary. Moreover, our algorithm is agnostic to the model under attack and hence, enables perfect membership inference even for models that do not memorize or overfit. In particular, our observations provide insight into the extent of information leakage from statistical aggregates and how they can be exploited.
机译:用于评估二进制分类器性能的常见度量是对数丢失分数,其是指示标签上预测分布与真正分布之间的跨熵距离的实数(由地面真理标签定义的点分布)。在本文中,我们表明,在获取对其预测上的对数损耗分数的访问时,可以利用这些信息来推断任意测试数据集的所有地面真理标签,以完全准确地推断出全面的真相标签。我们提供了一种有效的算法来执行此推断。一个特别有趣的应用程序,在这种攻击可以被剥削,是在会员推论攻击中违反隐私。这些攻击利用将客户数据培训培训的型号的漏洞利用到对手制定的查询。用于测量敏感数据集的泄漏的隐私审计工具评估了基于对攻击性的Dat-Apoint成员资格的预测的总隐私泄漏。因此,拟议攻击的一个例子,导致完整的会员隐私违规,避免了任何攻击模型培训或访问对手的侧面知识。此外,我们的算法对攻击的模型不可知,因此即使对于不记忆或过度装备的模型,也能够实现完美的隶属推断。特别是,我们的观察结果提供了洞察统计汇总的信息范围以及如何利用它们的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号