...
首页> 外文期刊>Expert Systems with Application >A data driven anonymization system for information rich online social network graphs
【24h】

A data driven anonymization system for information rich online social network graphs

机译:用于信息丰富的在线社交网络图的数据驱动匿名系统

获取原文
获取原文并翻译 | 示例

摘要

In recent years, online social networks have become a part of everyday life for millions of individuals. Also, data analysts have found a fertile field for analyzing user behavior at individual and collective levels, for academic and commercial reasons. On the other hand, there are many risks for user privacy, as information a user may wish to remain private becomes evident upon analysis. However, when data is anonymized to make it safe for publication in the public domain, information is inevitably lost with respect to the original version, a significant aspect of social networks being the local neighborhood of a user and its associated data. Current anonymization techniques are good at identifying risks and minimizing them, but not so good at maintaining local contextual data which relate users in a social network. Thus, improving this aspect will have a high impact on the data utility of anonymized social networks. Also, there is a lack of systems which facilitate the work of a data analyst in anonymizing this type of data structures and performing empirical experiments in a controlled manner on different datasets. Hence, in the present work we address these issues by designing and implementing a sophisticated synthetic data generator together with an anonymization processor with strict privacy guarantees and which takes into account the local neighborhood when anonymizing. All this is done for a complex dataset which can be fitted to a real dataset in terms of data profiles and distributions. In the empirical section we perform experiments to demonstrate the scalability of the method and the improvement in terms of reduction of information loss with respect to approaches which do not consider the local neighborhood context when anonymizing. (C) 2016 Elsevier Ltd. All rights reserved.
机译:近年来,在线社交网络已成为数百万个人日常生活的一部分。此外,出于学术和商业原因,数据分析师还发现了一个肥沃的领域,可以用于分析个人和集体级别的用户行为。另一方面,用户隐私存在许多风险,因为分析后用户可能希望保持隐私的信息变得明显。但是,当对数据进行匿名处理以使其可以安全地在公共领域中发布时,相对于原始版本,信息不可避免地会丢失,社交网络的重要方面是用户的本地邻居及其相关数据。当前的匿名化技术擅于识别风险并将其最小化,但并不擅长维护与社交网络中的用户相关的本地上下文数据。因此,改进这一方面将对匿名社交网络的数据实用性产生重大影响。而且,缺乏一种系统,该系统可便利数据分析人员匿名处理这种类型的数据结构并以可控方式对不同数据集执行经验性实验。因此,在当前工作中,我们通过设计和实现复杂的合成数据生成器以及具有严格隐私保护的匿名处理程序来解决这些问题,并且匿名处理时会考虑本地邻居。所有这些都是针对复杂的数据集完成的,该数据集可以根据数据配置文件和分布拟合到实际的数据集。在经验部分,我们进行实验以证明该方法的可扩展性以及相对于匿名时不考虑本地邻域上下文的方法而言,在减少信息丢失方面的改进。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号