首页> 美国卫生研究院文献>other >Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions
【2h】

Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions

机译:使用Copula函数的多维数据的差分私有合成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and computation complexity. In this paper, we propose DPCopula, a differentially private data synthesization technique using Copula functions for multi-dimensional data. The core of our method is to compute a differentially private copula function from which we can sample synthetic data. Copula functions are used to describe the dependence between multivariate random vectors and allow us to build the multivariate joint distribution using one-dimensional marginal distributions. We present two methods for estimating the parameters of the copula functions with differential privacy: maximum likelihood estimation and Kendall’s τ estimation. We present formal proofs for the privacy guarantee as well as the convergence property of our methods. Extensive experiments using both real datasets and synthetic datasets demonstrate that DPCopula generates highly accurate synthetic multi-dimensional data with significantly better utility than state-of-the-art techniques.
机译:最近,在私有统计数据发布中出现了差异性隐私,这是最强大的隐私保证之一。生成差分私有直方图或合成数据的大多数现有技术仅适用于一维或低维直方图。由于扰动误差和计算复杂性的增加,它们对于高维和大域数据而言成为问题。在本文中,我们提出了DPCopula,一种使用Copula函数处理多维数据的差分私有数据合成技术。我们方法的核心是计算差分私有copula函数,从中可以对合成数据进行采样。 Copula函数用于描述多元随机向量之间的依赖性,并允许我们使用一维边际分布建立多元联合分布。我们介绍了两种用于估计具有差分隐私的copula函数参数的方法:最大似然估计和Kendall的τ估计。我们提供有关隐私保证以及我们方法的收敛性的形式证明。使用真实数据集和合成数据集进行的大量实验表明,DPCopula可以生成高度精确的合成多维数据,其实用性要比最新技术好得多。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(2014),-1
  • 年度 -1
  • 页码 475–486
  • 总页数 38
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号