首页> 外文会议>International Conference on Computer Engineering Systems >A clustering approach for anonymizing distributed data streams
【24h】

A clustering approach for anonymizing distributed data streams

机译:一种匿名分布式数据流匿名的聚类方法

获取原文

摘要

Privacy preserving data mining have been studied widely on static data. Static algorithms are not suitable for streaming data. This imposes the study of new algorithms for privacy preserving that cope with data streams characteristics. Recently, effective anonymization algorithms have been studied on centralized data streams. In this paper we propose an approach for anonymizing distributed data streams based on clustering. First, anonymization is performed locally at each site by clustering a single stream, then local clusters are exchanged between sites through a global server to construct global clusters. The algorithm is shown to be effective when compared to a centralized algorithm and to the case where no communication is exchanged between sites. In addition, empirical results on real and synthetic data sets have shown that the proposed algorithm gives better information loss when compared to the without communication case and close results to the centralized case. Moreover, the algorithm is shown to be efficient in terms of communication and scalable with increasing number of sites.
机译:隐私保留数据挖掘已经在静态数据上广泛研究。静态算法不适合流数据。这强加了对隐私保护的新算法研究,该算法应对数据流特征。最近,已经在集中数据流上研究了有效的匿名化算法。在本文中,我们提出了一种基于群集匿名的分布式数据流匿名的方法。首先,通过群集单个流在每个站点处本地执行匿名化,然后通过全局服务器在站点之间交换本地群集以构建全局集群。与集中算法和在站点之间不交换通信的情况相比,该算法显示有效。此外,与无需通信案例相比,所提出的算法对实验和合成数据集的实证结果表明,当没有通信案例并将结果接近集中式情况时,该算法提供更好的信息丢失。此外,该算法在通信方面被证明是有效的,并且随着网站数量的越来越多地缩放。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号