Mathematical results on Database Privacy.

机译：关于数据库隐私的数学结果。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The central problem in Data Privacy is how to release valuable information pertaining to a group of individuals, while preserving their privacy. A key question is how far the data disclosure can go, without compromising the privacy of the individuals who contributed their data. Examples of this include databases containing health information about patients, customer electronic transactions, and web browsing history. In this work we focus on attacks to two types of mechanisms that are meant to protect privacy: output perturbation and de-identification.;We study the Hadamard adversary in the context of differential privacy [18], and prove several new results showing the amount of error that is necessarily caused by a randomized output perturbation sanitizer providing differential privacy. We show that the required amount of noise is excessively large, rendering the sanitized responses with little or no utility. We conclude that differential privacy against the Hadamard adversary comes with an extremely large cost on the utility of the sanitized responses.;Regarding the mechanisms that anonymize databases via de-identification, we perform several experiments with linkage attacks on real data contained in the microdata file of the Joint Canada/United States Survey of Health 2004 [47]. We show that with a large (or at least significant) probability, an adversary knowing a rather small amount of auxiliary information about the less sensitive attributes of the database, can successfully link such an auxiliary information (which could be associated with an identity) to the whole and anonymous record of the corresponding individual.;Then we review the theoretical result of Narayanan and Shmatikov [38] on database de-anonymization. We start by exhibiting counterexamples to their main theoretical proof, and develop new theorems of de-anonymization that fix these problems. We also contribute more and new theoretical results that incorporate hypotheses on the sparseness of the database, and contemplate the realistic situation in which the auxiliary information of the adversary contains rare attributes, which in turn improves the de-anonymization by requiring less auxiliary information.;On output perturbation, we revisit two known attacks: the Dinur-Nissim adversary [14] and the Hadamard adversary [23]. We extend the know results on the success of the Dinur-Nissim adversary to a more general and abstract setting that includes, in particular, both real-valued and binary databases---the case studied in [14]---each case with natural and appropriate metrics. In this general setting we provide a better explanation of the relationships between all the relevant parameters of the problem, and consequently obtain more efficient and versatile results on the performance of the adversary.

机译：数据隐私的中心问题是如何在保留个人隐私的同时发布与一群人有关的有价值的信息。一个关键的问题是数据公开可以走多远而又不损害贡献其数据的个人的隐私。这样的示例包括包含有关患者的健康信息，客户电子交易和Web浏览历史的数据库。在这项工作中，我们集中于攻击旨在保护隐私的两种类型的机制：输出扰动和去身份识别。；我们在差异性隐私的背景下研究了Hadamard对手[18]，并证明了一些新的结果表明错误是由提供差分隐私的随机输出扰动消毒器必然引起的。我们表明，所需的噪声量过大，使得经过消毒的响应几乎没有效用。我们得出的结论是，针对Hadamard对手的差异性隐私在清理后的响应的实用性方面付出了巨大的代价。;关于通过去标识使数据库匿名化的机制，我们对微数据文件中包含的真实数据进行了链接攻击的多次实验2004年加拿大/美国卫生联合调查[47]。我们表明，以较大（或至少显着）的概率，知道很少有关数据库较不敏感属性的辅助信息的对手可以成功地将此类辅助信息（可能与身份相关联）链接到然后，我们回顾了Narayanan和Shmatikov [38]在数据库去匿名化方面的理论结果。我们从展示其主要理论证据的反例开始，并开发解决这些问题的去匿名化新定理。我们还贡献了更多和新的理论结果，这些假设结合了关于数据库稀疏性的假设，并考虑了现实情况，即对手的辅助信息包含稀有属性，从而通过减少辅助信息来改善反匿名化。关于输出扰动，我们重新审视两种已知的攻击：Dinur-Nissim对手[14]和Hadamard对手[23]。我们将关于Dinur-Nissim对手成功的已知结果扩展到一个更通用和抽象的设置，其中特别包括实值数据库和二进制数据库（在[14]中研究的案例）-自然和适当的指标。在这种一般情况下，我们可以更好地解释问题的所有相关参数之间的关系，从而在对手的表现上获得更有效，更通用的结果。

著录项

作者
Merener, Martin Miguel.;
展开▼
作者单位

York University (Canada).;

展开▼
授予单位 York University (Canada).;
学科 Computer science.;Information technology.
学位 Ph.D.
年度 2012
页码 172 p.
总页数 172
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Mathematics in VINITI RAS: From Abstract Journal to Databases: Part 1. Development of the Abstract Journal (AJ) “Mathematics” from 1953 to 2007 [J] . I. Yu. Nikol’skaya, V. M. Yefremenkova Scientific & Technical Information Processing . 2008,第3期

机译：VINITI RAS中的数学：从抽象期刊到数据库：第1部分。1953年至2007年抽象期刊（AJ）“数学”的发展
2. 'Most Americans like their privacy.' Exploring privacy concerns through US guests' reviews [J] . DAcunto David, Volo Serena, Filieri Raffaele International Journal of Contemporary Hospitality Management . 2021,第8期

机译：“大多数美国人喜欢他们的隐私”。通过美国客人的评论探索隐私问题
3. Viewpoint Making the commitment to student data privacy. [J] . Linnette Atta T.H.E. Journal . 2017,第4期

机译：观点致力于保护学生数据的隐私。
4. Internet of Things and Blockchain: Legal Issues and Privacy. The Challenge for a Privacy Standard [C] . Nicola Fabiano 2017 IEEE International Conference on Internet of Things, IEEE Green Computing and Communications, IEEE Cyber, Physical and Social Computing, IEEE Smart Data . 2017

机译：物联网和区块链：法律问题和隐私。隐私标准的挑战
5. Cryptography for secure and private databases: Enabling practical data access without compromising privacy. [D] . Green, Matthew Daniel. 2009

机译：安全和私有数据库的密码术：在不损害隐私的情况下实现实际数据访问。
6. Vitamin D Serum Levels in the UK Population including a Mathematical Approach to Evaluate the Impact of Vitamin D Fortified Ready-to-Eat Breakfast Cereals: Application of the NDNS Database [O] . Wim Calame, Laura Street, Toine Hulshof 2020

机译：英国人口的维生素D血清水平包括一种数学方法来评估维生素D强化即食早餐谷物的影响：NDNS数据库的应用
7. Dependency Integration of Locally Independent Relational Databases into a Distributed Database (Mathematical Methods in Software Science and Engineering : Third Conference) [O] . TANAKA KATSUMI, KAMBAYASHI YAHIKO 1981

机译：本地独立关系数据库到分布式数据库的依赖集成（软件科学与工程中的数学方法：第三次会议）

Mathematical results on Database Privacy.

摘要

著录项

相似文献

相关主题

期刊订阅