首页> 外文会议>IEEE International Conference on Data Mining Workshops >Scalable Privacy-Preserving Linking of Multiple Databases Using Counting Bloom Filters
【24h】

Scalable Privacy-Preserving Linking of Multiple Databases Using Counting Bloom Filters

机译:使用计数布隆过滤器可扩展地保护多个数据库的隐私保护链接

获取原文

摘要

The integration, mining, and analysis of person-specific data can provide enormous opportunities for organizations, governments, and researchers to leverage today's massive data collections. However, the use of personal or otherwise sensitive data also raises concerns about the privacy, confidentiality, and potential discrimination of people. Privacy-preserving record linkage (PPRL) is a growing research area that aims at integrating sensitive information from multiple disparate databases held by different organizations while preserving the privacy of the individuals in these databases by not revealing their identities and thereby preventing re-identification and discrimination. PPRL approaches are increasingly required in many real-world application areas ranging from healthcare to national security. Previous approaches to PPRL have mostly focused on linking only two databases. Scaling PPRL to several databases is an open challenge since privacy threats as well as the computation and communication costs increase significantly with the number of databases involved. We thus propose a new encoding method of sensitive data based on Counting Bloom Filters (CBF) to improve privacy for multi-party PPRL (MP-PPRL). We investigate optimizations to reduce computation and communication costs for CBF-based MP-PPRL. Our empirical evaluation with real datasets demonstrates the viability of our approach in terms of scalability, linkage quality, and privacy.
机译:个人特定数据的集成,挖掘和分析可以为组织,政府和研究人员提供巨大的机会,以利用当今的大量数据收集。但是,使用个人数据或其他敏感数据也会引起人们对隐私,机密性和潜在歧视的担忧。隐私保护记录链接(PPRL)是一个正在发展的研究领域,旨在整合来自不同组织拥有的多个不同数据库的敏感信息,同时通过不透露其身份来保护这些数据库中个人的隐私,从而防止重新识别和歧视。从医疗保健到国家安全,许多实际应用领域都越来越需要PPRL方法。 PPRL的先前方法主要集中在仅链接两个数据库。将PPRL扩展到多个数据库是一个开放的挑战,因为隐私威胁以及计算和通信成本会随着所涉及数据库的数量而显着增加。因此,我们提出了一种基于计数布隆过滤器(CBF)的敏感数据编码新方法,以提高多方PPRL(MP-PPRL)的隐私性。我们研究优化以减少基于CBF的MP-PPRL的计算和通信成本。我们对真实数据集的实证评估证明了我们的方法在可伸缩性,链接质量和隐私方面的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号