首页> 外文学位 >Practical Differential Privacy for High-dimensional and Graph Data
【24h】

Practical Differential Privacy for High-dimensional and Graph Data

机译:高维和图形数据的实用差分隐私

获取原文
获取原文并翻译 | 示例

摘要

Differential privacy has emerged as a de facto standard of privacy notion. It is widely adopted in various domains, including data publishing, data mining, and interactive database queries. However, applying differential privacy on complex data still remains challenging due to the huge change of sensitivity. In this dissertation, we introduce three major topics about publishing information with high-dimensional and graph data under differential privacy. The first topic discusses the possibility of publishing column counts from high-dimensional data under differential privacy, with a proposed technique called sensitivity control. The idea is to limit the contribution of data records such that sensitivity can be limited. We solve the challenge of balancing the sensitivity level and remaining data utility. The second topic aims at solving the problem of high-dimensional data classification with differential privacy. We propose PrivWalk, a greedily walking algorithm that iteratively searches the optimal model and also automatically determines the number of steps given a privacy budget. In the third topic, we advance the technique to publish degree distribution from a graph under node-differential privacy. We develop a projection technique that preserves that most utility and also limits the sensitivity. Based on the projection method, we propose two approaches for publishing degree histograms. The experiments of the three topics demonstrate that our proposed techniques significantly improve the existing state of the art, making differential privacy on high-dimensional and graph data practical.
机译:差异隐私已成为事实上的隐私概念标准。它已在各个领域广泛采用,包括数据发布,数据挖掘和交互式数据库查询。但是,由于灵敏度的巨大变化,在复杂数据上应用差异隐私仍然具有挑战性。本文介绍了在隐私保护下发布具有高维数据和图形数据的信息的三个主要主题。第一个主题讨论了使用一种称为灵敏度控制的技术在差分隐私下发布来自高维数据的列数的可能性。这个想法是为了限制数据记录的贡献,从而可以限制敏感性。我们解决了在灵敏度水平和剩余数据实用性之间取得平衡的挑战。第二个主题旨在解决具有差异隐私的高维数据分类问题。我们提出了贪婪行走算法PrivWalk,该算法可迭代搜索最佳模型,并在给定隐私预算的情况下自动确定步骤数。在第三个主题中,我们改进了从节点差分隐私下的图发布度分布的技术。我们开发了一种投影技术,该技术可以保留大多数效用并限制灵敏度。基于投影方法,我们提出了两种发布度直方图的方法。这三个主题的实验表明,我们提出的技术显着改善了现有技术,使高维和图形数据的差分隐私实用。

著录项

  • 作者

    Day, Wei-Yen.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 147 p.
  • 总页数 147
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:54:25

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号