首页> 外文学位 >Mathematical results on Database Privacy.
【24h】

Mathematical results on Database Privacy.

机译:关于数据库隐私的数学结果。

获取原文
获取原文并翻译 | 示例

摘要

The central problem in Data Privacy is how to release valuable information pertaining to a group of individuals, while preserving their privacy. A key question is how far the data disclosure can go, without compromising the privacy of the individuals who contributed their data. Examples of this include databases containing health information about patients, customer electronic transactions, and web browsing history. In this work we focus on attacks to two types of mechanisms that are meant to protect privacy: output perturbation and de-identification.;We study the Hadamard adversary in the context of differential privacy [18], and prove several new results showing the amount of error that is necessarily caused by a randomized output perturbation sanitizer providing differential privacy. We show that the required amount of noise is excessively large, rendering the sanitized responses with little or no utility. We conclude that differential privacy against the Hadamard adversary comes with an extremely large cost on the utility of the sanitized responses.;Regarding the mechanisms that anonymize databases via de-identification, we perform several experiments with linkage attacks on real data contained in the microdata file of the Joint Canada/United States Survey of Health 2004 [47]. We show that with a large (or at least significant) probability, an adversary knowing a rather small amount of auxiliary information about the less sensitive attributes of the database, can successfully link such an auxiliary information (which could be associated with an identity) to the whole and anonymous record of the corresponding individual.;Then we review the theoretical result of Narayanan and Shmatikov [38] on database de-anonymization. We start by exhibiting counterexamples to their main theoretical proof, and develop new theorems of de-anonymization that fix these problems. We also contribute more and new theoretical results that incorporate hypotheses on the sparseness of the database, and contemplate the realistic situation in which the auxiliary information of the adversary contains rare attributes, which in turn improves the de-anonymization by requiring less auxiliary information.;On output perturbation, we revisit two known attacks: the Dinur-Nissim adversary [14] and the Hadamard adversary [23]. We extend the know results on the success of the Dinur-Nissim adversary to a more general and abstract setting that includes, in particular, both real-valued and binary databases---the case studied in [14]---each case with natural and appropriate metrics. In this general setting we provide a better explanation of the relationships between all the relevant parameters of the problem, and consequently obtain more efficient and versatile results on the performance of the adversary.
机译:数据隐私的中心问题是如何在保留个人隐私的同时发布与一群人有关的有价值的信息。一个关键的问题是数据公开可以走多远而又不损害贡献其数据的个人的隐私。这样的示例包括包含有关患者的健康信息,客户电子交易和Web浏览历史的数据库。在这项工作中,我们集中于攻击旨在保护隐私的两种类型的机制:输出扰动和去身份识别。;我们在差异性隐私的背景下研究了Hadamard对手[18],并证明了一些新的结果表明错误是由提供差分隐私的随机输出扰动消毒器必然引起的。我们表明,所需的噪声量过大,使得经过消毒的响应几乎没有效用。我们得出的结论是,针对Hadamard对手的差异性隐私在清理后的响应的实用性方面付出了巨大的代价。;关于通过去标识使数据库匿名化的机制,我们对微数据文件中包含的真实数据进行了链接攻击的多次实验2004年加拿大/美国卫生联合调查[47]。我们表明,以较大(或至少显着)的概率,知道很少有关数据库较不敏感属性的辅助信息的对手可以成功地将此类辅助信息(可能与身份相关联)链接到然后,我们回顾了Narayanan和Shmatikov [38]在数据库去匿名化方面的理论结果。我们从展示其主要理论证据的反例开始,并开发解决这些问题的去匿名化新定理。我们还贡献了更多和新的理论结果,这些假设结合了关于数据库稀疏性的假设,并考虑了现实情况,即对手的辅助信息包含稀有属性,从而通过减少辅助信息来改善反匿名化。关于输出扰动,我们重新审视两种已知的攻击:Dinur-Nissim对手[14]和Hadamard对手[23]。我们将关于Dinur-Nissim对手成功的已知结果扩展到一个更通用和抽象的设置,其中特别包括实值数据库和二进制数据库(在[14]中研究的案例)-自然和适当的指标。在这种一般情况下,我们可以更好地解释问题的所有相关参数之间的关系,从而在对手的表现上获得更有效,更通用的结果。

著录项

  • 作者

    Merener, Martin Miguel.;

  • 作者单位

    York University (Canada).;

  • 授予单位 York University (Canada).;
  • 学科 Computer science.;Information technology.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 172 p.
  • 总页数 172
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号