...
首页> 外文期刊>Information Sciences: An International Journal >Privacy preserving similarity joins using MapReduce
【24h】

Privacy preserving similarity joins using MapReduce

机译:隐私保留相似性使用MapReduce加入

获取原文
获取原文并翻译 | 示例

摘要

Similarity join is an essential operator in data processing, mining and analysis. However, it is resource intensive and time consuming, particularly when processing big data. There is also a need to ensure data confidentiality in similarity joins, as joining between two files may result in personal information disclosure. Based on these two considerations, this paper proposes a MapReduce-based similarity joins with differential privacy technology (hereafter, referred to as PSJoin). The proposed parallel algorithm is designed to achieve high efficiency, in terms of answering similarity join queries privately and effectively. Specifically, the use of PSJoin ensures the preservation of privacy during the similarity join process and in the published results. A new private global ordering approach is presented in this paper, which is designed to deal with potential disclosure issues during the process, and a differential private similarity function is provided for this algorithm. Findings from our evaluations using large-scale real-world datasets demonstrate that our method can effectively guarantee privacy with only minimal accuracy loss in similarity queries, while offering good scalability consistently. (C) 2019 Elsevier Inc. All rights reserved.
机译:相似之处是数据处理,挖掘和分析中的重要操作员。但是,它是资源密集和耗时的,特别是在处理大数据时。还需要确保在相似性连接中的数据机密性,因为两个文件之间的连接可能导致个人信息披露。基于这两种考虑,本文提出了一种基于MapReduce的相似性与差异隐私技术(以下,称为Psjoin)。拟议的并行算法旨在实现高效率,就私下和有效地应答相似之处。具体而言,使用PSJOIN确保在相似性加入过程和已发布结果中保存隐私。本文提出了一种新的全局全球订购方法,该方法旨在在此过程中处理潜在的披露问题,并且为该算法提供了差异私有相似功能。我们使用大规模现实世界数据集的评估结果表明,我们的方法可以有效地保证隐私,只能在相似性查询中的最小精度损失,同时始终如一地提供良好的可扩展性。 (c)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号