On Primes, Log-Loss Scores and (No) Privacy

机译：关于素质，日志损失分数和（否）隐私

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A common metric for assessing the performance of binary classifiers is the Log-Loss score, which is a real number indicating the cross entropy distance between the predicted distribution over the labels and the true distribution (a point distribution defined by the ground truth labels). In this paper, we show that a malicious modeler, upon obtaining access to the Log-Loss scores on its predictions, can exploit this information to infer all the ground truth labels of arbitrary test datasets with full accuracy. We provide an efficient algorithm to perform this inference. A particularly interesting application where this attack can be exploited is to breach privacy in the setting of Membership Inference Attacks. These attacks exploit the vulnerabilities of exposing models trained on customer data to queries made by an adversary. Privacy auditing tools for measuring leakage from sensitive datasets assess the total privacy leakage based on the adversary's predictions for dat-apoint membership. An instance of the proposed attack can hence, cause complete membership privacy breach, obviating any attack model training or access to side knowledge with the adversary. Moreover, our algorithm is agnostic to the model under attack and hence, enables perfect membership inference even for models that do not memorize or overfit. In particular, our observations provide insight into the extent of information leakage from statistical aggregates and how they can be exploited.

机译：用于评估二进制分类器性能的常见度量是对数丢失分数，其是指示标签上预测分布与真正分布之间的跨熵距离的实数（由地面真理标签定义的点分布）。在本文中，我们表明，在获取对其预测上的对数损耗分数的访问时，可以利用这些信息来推断任意测试数据集的所有地面真理标签，以完全准确地推断出全面的真相标签。我们提供了一种有效的算法来执行此推断。一个特别有趣的应用程序，在这种攻击可以被剥削，是在会员推论攻击中违反隐私。这些攻击利用将客户数据培训培训的型号的漏洞利用到对手制定的查询。用于测量敏感数据集的泄漏的隐私审计工具评估了基于对攻击性的Dat-Apoint成员资格的预测的总隐私泄漏。因此，拟议攻击的一个例子，导致完整的会员隐私违规，避免了任何攻击模型培训或访问对手的侧面知识。此外，我们的算法对攻击的模型不可知，因此即使对于不记忆或过度装备的模型，也能够实现完美的隶属推断。特别是，我们的观察结果提供了洞察统计汇总的信息范围以及如何利用它们的信息。

著录项

来源
《Workshop on Privacy in Natural Language Processing》|2020年|1-6|共6页
会议地点
作者
Abhinav Aggarwal; Zekun Xu; Oluwaseyi Feyisetan; Nathanael Teissier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Exploiting cryptography for privacy-enhanced access control: A result of the PRIME Project [J] . Claudio A. Ardagna, Jan Camenisch, Markulf Kohlweiss, Journal of computer security . 2010,第1期

机译：利用加密技术增强隐私的访问控制：PRIME项目的结果
2. Legal considerations on privacy-enhancing Location Based Services using PRIME technology [J] . Eleni Kosta, Jan Zibuschka, Tobias Scherner, Computer law & security report . 2008,第2期

机译：使用PRIME技术增强隐私的基于位置的服务的法律注意事项
3. An automated model to score the privacy of unstructured information—Social media case [J] . Erfan Aghasian, Saurabh Garg, James Montgomery Computers & Security . 2020,第May期

机译：对非结构化信息的隐私进行评分的自动化模型-社交媒体案例
4. PRIVACY SCORE: MAKING PRIVACY ASPECTS OF SURVEILLANCE SYSTEMS COMPARABLE [C] . Erik Krempel, Jurgen Beyerer Security Research Conference . 2015

机译：隐私分数：使监控系统的隐私方面可比
5. A study of Altman's revised four-variable Z″-score bankruptcy prediction model as it applies to the service industry (Edward I. Altman). [D] . Hanson, Richard O. 2003

机译：对Altman修改后的四变量Z''评分破产预测模型的研究，该模型适用于服务行业（Edward I. Altman）。
6. Combining distributed regression and propensity scores: a doubly privacy-protecting analytic method for multicenter research [O] . Sengwee Toh, Robert Wellman, R Yates Coley, 2018

机译：结合分布式回归和倾向得分：用于多中心研究的双重隐私保护分析方法
7. PriMe : human-centric privacy measurement based on user preferences towards data sharing in mobile participatory sensing systems [O] . Liu R, Cao JN, VanSyckel S, 2016

机译：PriMe：以用户为中心的隐私测量，基于用户对移动参与式传感系统中数据共享的偏好

On Primes, Log-Loss Scores and (No) Privacy

摘要

著录项

相似文献

相关主题

期刊订阅