首页> 外文会议>IEEE Annual Symposium on Foundations of Computer Science >A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis
【24h】

A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis

机译:一种乘法保存数据分析的权重机制

获取原文

摘要

We consider statistical data analysis in the interactive setting. In this setting a trusted curator maintains a database of sensitive information about individual participants, and releases privacy-preserving answers to queries as they arrive. Our primary contribution is a new differentially private multiplicative weights mechanism for answering a large number of interactive counting (or linear) queries that arrive online and may be adaptively chosen. This is the first mechanism with worst-case accuracy guarantees that can answer large numbers of interactive queries and is {em efficient} (in terms of the runtime's dependence on the data universe size). The error is asymptotically emph{optimal} in its dependence on the number of participants, and depends only logarithmically on the number of queries being answered. The running time is nearly {em linear} in the size of the data universe. As a further contribution, when we relax the utility requirement and require accuracy only for databases drawn from a rich class of databases, we obtain exponential improvements in running time. Even in this relaxed setting we continue to guarantee privacy for {em any} input database. Only the utility requirement is relaxed. Specifically, we show that when the input database is drawn from a {em smooth} distribution — a distribution that does not place too much weight on any single data item — accuracy remains as above, and the running time becomes {em poly-logarithmic} in the data universe size. The main technical contributions are the application of multiplicative weights techniques to the differential privacy setting, a new privacy analysis for the interactive setting, and a technique for reducing data dimensionality for databases drawn from smooth distributions.
机译:我们考虑互动设置中的统计数据分析。在此设置中,可信策展人维护有关各个参与者的敏感信息数据库,并在其到达时释放隐私保留答案。我们的主要贡献是一种新的差异私有乘法权重机制,用于回答在线到达的大量交互计数(或线性)查询,并且可以自适应地选择。这是第一种具有最坏情况准确度保证的机制,可以回答大量交互式查询,并且是{EM高效}(根据运行时对数据宇宙大小的依赖项而言)。错误是渐近的表明{最佳}在其对参与者的数量的依赖下,并且仅依赖于对数上回答的查询数。在数据宇宙的大小中,运行时间几乎是{EM线性}。作为进一步的贡献,当我们放宽实用性要求并要求仅针对来自丰富类数据库的数据库准确性,我们获得运行时间的指数改进。即使在这种轻松的环境中,我们也继续保证隐私为{EM任何}输入数据库。只有实用性要求放松。具体来说,我们表明,当输入数据库从{EM平滑}分布绘制时 - 在任何单个数据项上没有放置太多重量的分布 - 准确性保持在上面,并且运行时间变为{EM poly-logarithmic}在数据宇宙大小。主要技术贡献是将乘法权重技术应用于差异隐私设置,是交互式设置的新隐私分析,以及用于减少从平滑分布中汲取的数据库的数据维度的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号