...
首页> 外文期刊>Journal of supercomputing >Distributed L-diversity using spark-based algorithm for large resource description frameworks data
【24h】

Distributed L-diversity using spark-based algorithm for large resource description frameworks data

机译:使用基于火花的大资源描述框架数据分布式L-多样性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Privacy protection issues for resource description frameworks (RDFs) have emerged over the use of public government open data and the healthcare data of individuals. As these data may include personal information, they must undergo a de-identification process that deletes or replaces parts of the original data. To enable these protections, a method has been developed to apply k-anonymization to RDF data. However, sensitive RDF information anonymized using k-anonymization is not completely secure and is vulnerable to attacks. In this paper, we propose an l-diversity anatomy de-identification method that can overcome the limitations of k-anonymity and guarantee stronger privacy protection than k-anonymization. Further, as this data anonymization process is computationally time-intensive, we use Spark distributed computing to provide rapid de-identification to enhance its utility. We also propose l-diversity preservation for dynamically evolving RDF datasets. Experimental results show that our proposed distributed l-diversity algorithm processes the data more efficiently than conventional approaches.
机译:资源描述框架(RDFS)的隐私保护问题已经过度使用公共政府开放数据和个人的医疗保健数据。由于这些数据可以包括个人信息,因此它们必须经历删除或替换部分原始数据的去识别过程。为了实现这些保护,已经开发了一种方法来将K-匿名化应用于RDF数据。但是,使用k-匿名化匿名的敏感的RDF信息并不完全安全,并且易于攻击。在本文中,我们提出了一个L-多样性解剖解析方法,可以克服K-Anonyment的局限性,并保证比K-onynamamization更强的隐私保护。此外,随着该数据的匿名过程是计算时间密集的,我们使用Spark分布式计算来提供快速的去识别,以增强其实用程序。我们还提出了L-多样性保存,以便动态地发展RDF数据集。实验结果表明,我们提出的分布式L-多样性算法比传统方法更有效地处理数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号