【24h】

Privacy via pseudorandom sketches

机译:通过伪随机草图进行隐私保护

获取原文

摘要

Imagine a collection of individuals who each possess private data that they do not wish to share with a third party. This paper considers how individuals may represent and publish their own data so as to simultaneously preserve their privacy and to ensure that it is possible to extract large-scale statistical behavior from the original unperturbed data. Existing techniques for perturbing data are limited by the number of users required to obtain approximate answers to queries, the richness of preserved statistical behavior, the privacy guarantees given and/or the amount of data that each individual must publish.This paper introduces a new technique to describe parts of an individual's data that is based on pseudorandom sketches. The sketches guarantee that each individual's privacy is provably maintained assuming one of the strongest definitions of privacy that we are aware of: given unlimited computational power and arbitrary partial knowledge, the attacker can not learn any additional private information from the published sketches. However, sketches from multiple users that describe a subset of attributes can be used to estimate the fraction of users that satisfy any conjunction over the full set of negated or unnegated attributes provided that there are enough users. We show that the error of approximation is independent of the number of attributes involved and only depends on the number of users available. An additional benefit is that the size of the sketch is minuscule: [log log O(M)] bits, where M is the number of users. Finally, we show how sketches can be combined to answer more complex queries. An interesting property of our approach is that despite using cryptographic primitives, our privacy guarantees do not rely on any unproven cryptographic conjectures.
机译:想象一下一个个人集合,每个人拥有不希望与第三方共享的私人数据。本文考虑了个人如何表示和发布自己的数据,以同时保护其隐私并确保可以从原始的不受干扰的数据中提取大规模的统计行为。现有的扰动数据的技术受到以下限制:获得查询的近似答案所需的用户数量,保留的统计行为的丰富性,给定的隐私保证和/或每个人必须发布的数据量。本文介绍了一种新技术描述基于伪随机草图的个人数据的某些部分。这些草图保证了我们所知的最严格的隐私定义之一,可证明每个人的隐私得到了可维护的保护:给定无限的计算能力和任意的部分知识,攻击者无法从已发布的草图中获取任何其他私人信息。但是,来自多个用户的描述属性子集的草图可用于估计满足否定或未否定属性的全部条件(如果有足够的用户)的任何用户的比例。我们表明,近似误差与所涉及的属性数量无关,并且仅取决于可用用户的数量。另一个好处是,草图的大小很小:[log log O(M)]位,其中 M 是用户数。最后,我们展示了如何将草图组合起来以回答更复杂的查询。我们方法的一个有趣的特性是,尽管使用了加密原语,但我们的隐私保证并不依赖于任何未经验证的加密猜想。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号