首页> 外文会议>4th ACM symposium on information, computer and communications security 2009 >The Union-Split Algorithm and Cluster-Based Anonymization of Social Networks
【24h】

The Union-Split Algorithm and Cluster-Based Anonymization of Social Networks

机译:社交网络的联合拆分算法和基于集群的匿名化

获取原文
获取原文并翻译 | 示例

摘要

Knowledge discovery on social network data can uncover latent social trends and produce valuable findings that benefit the welfare of the general public. A growing amount of research finds that social networks play a surprisingly powerful role in people's behaviors. Before the social network data can be released for research purposes, the data needs to be anonymized to prevent potential re-identification attacks. Most of the existing anonymization approaches were developed for relational data, and cannot be used to handle social network data directly.rnIn this paper, we model social networks as undirected graphs and formally define privacy models, attack models for the anonymization problem, in particular an i-hop degree-based anonymization problem, i.e., the adversary's prior knowledge includes the target's degree and the degrees of neighbors within i hops from the target. We present two new and efficient clustering methods for undirected graphs: bounded t-means clustering and union-split clustering algorithms that group similar graph nodes into clusters with a minimum size constraint. These clustering algorithms are contributions beyond the specific social network problems studied and can be used to cluster general data types besides graph vertices. We also develop a simple-yet-effective inter-cluster matching method for anonymizing social networks by strategically adding and removing edges based on nodes' social roles. We carry out a series of experiments to evaluate the graph utilities of the anonymized social networks produced by our algorithms.
机译:关于社交网络数据的知识发现可以发现潜在的社会趋势并产生有价值的发现,从而有益于广大公众。越来越多的研究发现,社交网络在人们的行为中起着惊人的强大作用。在出于研究目的发布社交网络数据之前,需要对数据进行匿名处理以防止潜在的重新标识攻击。现有的大多数匿名方法都是为关系数据开发的,不能直接用于处理社交网络数据。在本文中,我们将社交网络建模为无向图,并正式定义隐私模型,针对匿名问题的攻击模型,尤其是基于i跳程度的匿名化问题,即对手的先验知识包括目标的程度和距目标i跳内的邻居的程度。我们为无向图提供了两种新的高效聚类方法:有界t均值聚类和联合拆分聚类算法,这些算法将相似的图节点分组为具有最小大小约束的聚类。这些聚类算法的作用超出了研究的特定社交网络问题,可用于聚类图顶点以外的常规数据类型。我们还开发了一种简单有效的集群间匹配方法,该方法通过根据节点的社会角色策略性地添加和删除边缘来匿名化社交网络。我们进行了一系列实验,以评估由我们的算法产生的匿名社交网络的图效用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号