【24h】

Anonymization in the Time of Big Data

机译:大数据时代的匿名化

获取原文

摘要

In this work we explore how viable is anonymization to prevent disclosure in structured big data. For the sake of concreteness, we focus on fc-anonymity, which is the best-known privacy model based on anonymization. We identify two main challenges to use fc-anonymity in big data. First, confidential attributes can also be quasi-identifier attributes, which increases the number of quasi-identifier attributes and may lead to a large information loss to attain fc-anonymity. Second, in big data there is an unlimited number of data controllers, who may publish independent fc-anonymous releases on overlapping populations of subjects; the fc-anonymity guarantee does not longer hold if an observer pools such independent releases. We propose solutions to deal with the above two challenges. Our conclusion is that, with the proposed adjustments, fc-anonymity is still useful in a context of big data.
机译:在这项工作中,我们探索匿名化如何防止在结构化大数据中泄露。为了具体起见,我们将重点放在fc-anonymity上,它是基于匿名化的最著名的隐私模型。我们确定了在大数据中使用fc-匿名性的两个主要挑战。首先,机密属性也可以是准标识符属性,这增加了准标识符属性的数量,并可能导致大量信息丢失,从而导致fc匿名。其次,在大数据中,可以使用无限数量的数据控制者,他们可以在重叠的主题人群中发布独立的fc-anonymous版本。如果观察者汇集了这样的独立发行版,则fc-anonymity保证将不再成立。我们提出解决上述两个挑战的解决方案。我们的结论是,通过建议的调整,fc-匿名在大数据环境中仍然有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号