首页> 外文会议>Data Privacy International COnference >Parallelizing Record Linkage for Disclosure Risk Assessment
【24h】

Parallelizing Record Linkage for Disclosure Risk Assessment

机译:对披露风险评估的并行性记录连锁

获取原文

摘要

Handling very large volumes of confidential data is becoming a common practice in many organizations such as statistical agencies. This calls for the use of protection methods that have to be validated in terms of the quality they provide. With the use of Record Linkage (RL) it is possible to compute the disclosure risk, which gives a measure of the quality of a data protection method. However, the RL methods proposed in the literature are computationally costly, which poses difficulties when frequent RL processes have to be executed on large data. Here, we propose a distributed computing technique to improve the performance of a RL process. We show that our technique not only improves the computing time of a RL process significantly, but it is also scalable in a distributed environment. Also, we show that distributed computation can be complemented with SMP based parallelization in each node increasing the final speedup.
机译:处理非常大量的机密数据正在成为统计机构等许多组织的常见做法。这需要使用必须在其提供的质量方面验证的保护方法。通过使用记录链接(RL),可以计算披露风险,这给出了数据保护方法的质量的量度。然而,文献中提出的RL方法是计算昂贵的,当必须在大数据上执行频繁的RL过程时造成困难。在这里,我们提出了一种分布式计算技术来提高R1过程的性能。我们表明我们的技术不仅可以显着提高RL过程的计算时间,而且还可以在分布式环境中扩展。此外,我们表明,分布式计算可以在每个节点中增加基于SMP的并行化,增加最终加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号