【24h】

Privacy Preserving Clustering

机译:隐私保护群集

获取原文
获取原文并翻译 | 示例

摘要

The freedom and transparency of information flow on the Internet has heightened concerns of privacy. Given a set of data items, clustering algorithms group similar items together. Clustering has many applications, such as customer-behavior analysis, targeted marketing, forensics, and bioinformatics. In this paper, we present the design and analysis of a privacy-preserving k-means clustering algorithm, where only the cluster means at the various steps of the algorithm are revealed to the participating parties. The crucial step in our privacy-preserving k-means is privacy-preserving computation of cluster means. We present two protocols (one based on oblivious polynomial evaluation and the second based on homomorphic encryption) for privacy-preserving computation of cluster means. We have a JAVA implementation of our algorithm. Using our implementation, we have performed a thorough evaluation of our privacy-preserving clustering algorithm on three data sets. Our evaluation demonstrates that privacy-preserving clustering is feasible, i.e., our homomorphic-encryption based algorithm finished clustering a large data set in approximately 66 seconds.
机译:Internet上信息流的自由和透明引起了人们对隐私的关注。给定一组数据项,聚类算法将相似的项组合在一起。群集具有许多应用程序,例如客户行为分析,定向营销,取证和生物信息学。在本文中,我们介绍了一种保护隐私的k均值聚类算法的设计和分析,该算法仅将算法各个步骤中的聚类方法透露给与会各方。我们的隐私保护k均值中的关键步骤是聚类平均数的隐私保护计算。我们提出两种协议(一种基于遗忘多项式求值,另一种基于同态加密),用于聚类平均值的隐私保护计算。我们有我们算法的JAVA实现。使用我们的实现,我们对三个数据集对我们的隐私保护聚类算法进行了全面评估。我们的评估表明,保护隐私的聚类是可行的,即我们基于同态加密的算法在大约66秒内完成了对大型数据集的聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号