The Price of Selection in Differential Privacy

Mitali Bafna; Jonathan Ullman

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >The Price of Selection in Differential Privacy

【24h】

The Price of Selection in Differential Privacy

机译：差异隐私中的选择价格

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the differentially private top-$k$ selection problem, we are given a dataset $X ∈pmo^n imes d$, in which each row belongs to an individual and each column corresponds to some binary attribute, and our goal is to find a set of $k ?d$ columns whose means are approximately as large as possible. Differential privacy requires that our choice of these $k$ columns does not depend too much on any on individual’s dataset. This problem can be solved using the well known exponential mechanism and composition properties of differential privacy. In the high-accuracy regime, where we require the error of the selection procedure to be to be smaller than the so-called sampling error $α≈sqrtln(d)$, this procedure succeeds given a dataset of size $n ?k ln(d)$. We prove a matching lower bound, showing that a dataset of size $n ?k ln(d)$ is necessary for private top-$k$ selection in this high-accuracy regime. Our lower bound shows that selecting the $k$ largest columns requires more data than simply estimating the value of those $k$ columns, which can be done using a dataset of size just $n ?k$.

机译：在差分私有的top- $ k $选择问题中，我们得到了一个数据集$ X∈ pmo ^ n times d $，其中每一行属于一个个体，每一列对应于一个二进制属性，我们的目标是查找一组$ k？d $列，其均值应尽可能大。差异隐私要求我们对这些$ k $列的选择不取决于个人数据集。使用众所周知的指数机制和差分隐私的合成属性可以解决此问题。在高精度体制中，我们要求选择过程的误差要小于所谓的采样误差$α≈ sqrt ln（d）/ n $，该过程在给定大小数据集的情况下成功$ n？k ln（d）$。我们证明了一个匹配的下界，表明在这种高精度体制中，对于私有顶部$ k $的选择，需要大小为$ n？k ln（d）$的数据集。我们的下限显示，选择$ k $个最大的列比简单地估计这些$ k $列的值需要更多的数据，这可以使用大小仅为$ n？k $的数据集来完成。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第3期|共18页
作者
Mitali Bafna; Jonathan Ullman;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Correlated Differential Privacy: Feature Selection in Machine Learning [J] . Zhang Tao, Zhu Tianqing, Xiong Ping, IEEE transactions on industrial informatics . 2020,第3期

机译：相关差异隐私：机器学习中的特征选择
2. Differential privacy-based evaporative cooling feature selection and classification with relief-F and random forests [J] . Bioinformatics . 2017,第18期

机译：基于差动隐私的蒸发冷却特征选择和分类与救济 - F和随机森林
3. Individual Differential Privacy: A Utility-Preserving Formulation of Differential Privacy Guarantees [J] . Jordi Soria-Comas, Josep Domingo-Ferrer, David Sánchez, Information Forensics and Security, IEEE Transactions on . 2017,第6期

机译：个人差异性隐私：差异性隐私保证的实用程序
4. Privacy Bargaining with Fairness: Privacy-Price Negotiation System for Applying Differential Privacy in Data Market Environments [C] . Kangsoo Jung, Seog Park IEEE International Conference on Big Data . 2019

机译：公平地进行隐私讨价还价：在数据市场环境中应用差异隐私的隐私价格协商系统
5. Privacy-Preserving Machine Learning via Data Compression & Differential Privacy [D] . Chanyaswad, Theerachai. 2018

机译：通过数据压缩和差异隐私保护隐私的机器学习
6. Differential privacy-based evaporative cooling feature selection and classification with relief-F and random forests [O] . Trang T Le, W Kyle Simmons, Masaya Misaki, -1

机译：基于浮雕F和随机森林的基于差分隐私的蒸发冷却特征选择和分类
7. Fingerprinting Codes and the Price of Approximate Differential Privacy [O] . Mark Bun, Jonathan Ullman, Salil Vadhan 2018

机译：指纹码代码和差别差异隐私的价格

The Price of Selection in Differential Privacy

摘要

著录项

相似文献

相关主题

期刊订阅