首页> 外文会议>ACM SIGMOD international conference on management of data >Differentially Private Aggregation of Distributed Time-Series with Transformation and Encryption
【24h】

Differentially Private Aggregation of Distributed Time-Series with Transformation and Encryption

机译:具有转换和加密的分布式时间序列的差异私有聚合

获取原文

摘要

We propose PASTE, the first differentially private aggregation algorithm for distributed time-series data that offers good practical utility without any trusted server. PASTE addresses two important challenges in participatory data-mining applications where (ⅰ) individual users collect temporally correlated time-series data (such as location traces, web history, personal health data), and (ⅱ) an untrusted third-party aggregator wishes to run aggregate queries on the data. To address this, PASTE incorporates two new algorithms. To ensure differential privacy for time-series data despite the presence of temporal correlation, PASTE uses the Fourier Perturbation Algorithm (FPA_k). Standard differential privacy techniques perform poorly for time-series data. To answer n queries, such techniques can result in a noise of (☉)(n) to each query answer, making the answers practically useless if n is large. Our FPA_k algorithm perturbs the Discrete Fourier Transform of the query answers. For answering n queries, FPA_k improves the expected error from ☉(n) to roughly ☉(k) where k is the number of Fourier coefficients that can (approximately) reconstruct all the n query answers. Our experiments show that k 《 n for many real-life-data-sets resulting in a huge error-improvement for FPA_k. To deal with the absence of a trusted central server, PASTE uses the Distributed Laplace Perturbation Algorithm (DLPA) that adds noise in a distributed way in order to guarantee differential privacy. To the best of our knowledge, DLPA is the first distributed differentially private algorithm that can scale with a large number of users: DLPA outperforms the only other distributed solution for differential privacy proposed so far, by reducing the computational load per user from O(U) to O(1) where U is the number of users.
机译:我们建议锡膏,第一有差异的私人聚合算法为分布式的时间序列数据,没有任何可信的服务器提供了良好的实用性。 PASTE地址在(ⅰ)的个人用户收集时间相关的时间序列数据(如地理位置痕迹,Web历史记录,个人健康数据),和参与数据挖掘应用程序的两个重要挑战(ⅱ)不受信任的第三方信息汇总意愿运行聚集查询的数据。为了解决这个问题,PASTE集成了两个新的算法。为了确保的时间序列数据的差分隐私尽管时间相关性的存在,PASTE使用傅立叶微扰算法(FPA_k)。标准差的保密技术表现不佳的时间序列数据。要回答N次查询,这种技术可能会导致(☉)(n)的每个查询答案的噪声,使得如果答案几乎没用n很大。我们FPA_k算法扰乱离散傅立叶变换的查询答案。回答N次查询,FPA_k改善了从☉(n)的预期的误差大致☉(K),其中k是可以(大约)重建的所有n的查询答案的傅立叶系数的个数。我们的实验表明,K“n表示许多产生了巨大的错误,改进FPA_k现实生活中的数据集。为了解决缺乏信任的中央服务器,粘贴使用分布式拉普拉斯摄动法(DLPA),增加了噪声分布的方式,以保证差分隐私。据我们所知,DLPA是第一分配差异私有算法,可以用大量的用户规模:DLPA优于差分隐私的唯一其他分布式解决方案提出至今,选自O降低每用户的计算负荷(U )至O(1)其中U是用户的数目。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号