Distributed Cardinality Estimation of Set Operations with Differential Privacy

机译：具有差分隐私的集合操作的分布式基数估计

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we study the problem of estimating the cardinality of pairwise set operations (union and intersection) over sets possessed by different data owners, while preserving differential privacy. In our problem setting, a data owner could only communicate with an untrusted server, and thus have to perturb its set data for privacy protection before sharing them with the server. This problem setting is relevant to diverse applications in practice, including sensor-based traffic monitoring, cross-domain data integration, and combining findings from multiple surveys. To tackle this problem, we first adopt existing randomized response technique to perturb the bit vector (to achieve differential privacy) and develop tools which the server can use to derive the cardinality of set operations from the randomized bit vectors. However, the variance of the union/intersection estimator grows linearly with the universe (bit-vector) size which is impractical for large universes. To keep the variance low we in addition propose to resort to Bloom filters instead of high-dimensional bit vectors to share set data with the server. The key insight is that in spite of inevitable collisions in BF by keeping its size small we can bound the variance of the union/intersection cardinality estimators. Finally, we show that investing a small part of the privacy budget into reporting (obfuscated) set cardinality can further reduce the estimator errors for up to 20%. Our empirical analysis reveals the impact of various parameters including privacy budget and Bloom filter size on the overall accuracy of the approach and demonstrates the utility of the proposed solution.

机译：在本文中，我们研究了在保留差异隐私的同时，估计不同数据所有者拥有的集合上的成对集合操作（联合和交集）的基数的问题。在我们的问题设置中，数据所有者只能与不受信任的服务器通信，因此在与服务器共享数据之前，必须先扰动其设置的数据以保护隐私。此问题的设置与实践中的各种应用程序相关，包括基于传感器的流量监控，跨域数据集成以及合并来自多个调查的结果。为了解决这个问题，我们首先采用现有的随机响应技术来扰动位向量（以实现差分隐私），并开发工具，服务器可以使用该工具从随机位向量中导出集合操作的基数。但是，并集/相交估计量的方差随宇宙（位向量）的大小线性增长，这对于大宇宙是不切实际的。为了使方差保持较低，我们还建议使用Bloom过滤器代替高维位向量来与服务器共享设置数据。关键的见解是，尽管在BF中不可避免地发生碰撞，但通过保持其大小较小，我们可以限制联合/交叉点基数估计量的方差。最终，我们表明，将隐私预算的一小部分投入到报告（模糊的）集合基数上，可以进一步将估计器错误减少多达20％。我们的经验分析揭示了各种参数（包括隐私预算和布隆过滤器大小）对方法总体精度的影响，并证明了所提出解决方案的实用性。

著录项

来源
《2017 IEEE Symposium on Privacy-Aware Computing》|2017年|37-48|共12页
会议地点 Washington(US)
作者
Rade Stanojevic; Mohamed Nabeel; Ting Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Privacy; Servers; Data privacy; Sensors; Monitoring; Estimation; Electronic mail;

机译：隐私;服务器;数据隐私;传感器;监视;估计;电子邮件;;
入库时间 2022-08-26 13:55:15

相似文献

外文文献
中文文献
专利

1. Privacy-preserving disjunctive normal form operations on distributed sets [J] . Chun J.Y., Hong D., Jeong I.R., Information Sciences: An International Journal . 2013,第Null期

机译：隐私保留分布式集的拆除正常形式操作
2. Distributed Hash Sketches: Scalable, Efficient, And Accurate Cardinality Estimation For Distributed Multisets [J] . N. NTARMOS, P. TRIANTAFILLOU, G. WEIKUM ACM transactions on computer systems . 2009,第1期

机译：分布式哈希草图：分布式多集的可扩展，高效且准确的基数估计
3. Quantum Multiparty Privacy Set Intersection Cardinality [J] . Shi Run-Hua IEEE Transactions on Circuits and Systems. II, Express Briefs . 2021,第4期

机译：Quantum MultiParty隐私设定交叉基数
4. Distributed Cardinality Estimation of Set Operations with Differential Privacy [C] . Rade Stanojevic, Mohamed Nabeel, Ting Yu IEEE Symposium on Privacy-Aware Computing . 2017

机译：用差异隐私的分布式基数估计
5. Differentially Private Set Intersection Cardinalities in the Multiparty Setting [D] . Hadley, Connor. 2018

机译：差异私有设置多党设定的交叉线
6. Angular Rate Estimation Using a Distributed Set of Accelerometers [O] . Sungsu Park, Sung Kyung Hong 2012

机译：使用一组分布式加速度计的角速率估计
7. Distributed Set-Expression Cardinality Estimation [O] . 2008

机译：分布式集合表达基数估计

Distributed Cardinality Estimation of Set Operations with Differential Privacy

摘要

著录项

相似文献

相关主题

期刊订阅