首页> 外文会议>International Conference on Data Engineering >Constrained Private Mechanisms for Count Data
【24h】

Constrained Private Mechanisms for Count Data

机译:约束数据的受限私有机制

获取原文

摘要

Concern about how to aggregate sensitive user data without compromising individual privacy is a major barrier to greater availability of data. Differential privacy has emerged as an accepted model to release sensitive information while giving a statistical guarantee for privacy. Many different algorithms are possible to address different target functions. We focus on the core problem of count queries, and seek to design mechanisms to release data associated with a group of n individuals. Prior work has focused on designing mechanisms by raw optimization of a loss function, without regard to the consequences on the results. This can leads to mechanisms with undesirable properties, such as never reporting some outputs (gaps), and overreporting others (spikes). We tame these pathological behaviors by introducing a set of desirable properties that mechanisms can obey. Any combination of these can he satisfied by solving a linear program (LP) which minimizes a cost function, with constraints enforcing the properties. We focus on a particular cost function, and provide explicit constructions that are optimal for certain combinations of properties, and show a closed form for their cost. In the end, there are only a handful of distinct optimal mechanisms to choose between: one is the well-known (truncated) geometric mechanism; the second a novel mechanism that we introduce here, and the remainder are found as the solution to particular LPs. These all avoid the bad behaviors we identify. We demonstrate in a set of experiments on real and synthetic data which is preferable in practice, for different combinations of data distributions, constraints, and privacy parameters.
机译:关注如何在不影响个人隐私的情况下聚合敏感的用户数据是更大可用性的主要障碍。差异隐私已成为释放敏感信息的接受模型,同时为隐私提供统计保障。许多不同的算法可以解决不同的目标函数。我们专注于计数查询的核心问题,并寻求设计用于释放与一组N个体相关的数据的机制。在未经结果的原始优化的原始优化的情况下,事先工作的重点是设计机制,而不考虑结果的后果。这可以导致具有不良属性的机制,例如从未报告一些输出(差距),并过度报告其他(尖峰)。我们通过引入机制可以服从的一系列理想的属性驯服这些病理行为。通过求解最小化成本函数的线性程序(LP),他可以满足这些组合,这是强制执行属性的约束。我们专注于特定的成本函数,并提供对某些属性组合的最佳结构的显式结构,并为其成本显示封闭形式。最后,只有少数不同的最佳机制可以选择:一个是众所周知的(截断的)几何机制;我们在此引入的第二种新机制,并且其余部分被发现为特定LPS的溶液。这些都避免了我们识别的不良行为。我们在一组关于实际和合成数据的实验中展示,该实际在实践中是优选的,用于不同的数据分布,约束和隐私参数的不同组合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号