首页> 外文会议>13th international conference on extending database technology 2010 >The Hardness and Approximation Algorithms for L-Diversity
【24h】

The Hardness and Approximation Algorithms for L-Diversity

机译:L-多样性的硬度和近似算法

获取原文
获取原文并翻译 | 示例

摘要

The existing solutions to privacy preserving publication can be classified into the theoretical and heuristic categories. The former guarantees provably low information loss, whereas the latter incurs gigantic loss in the worst case, but is shown empirically to perform well on many real inputs. While numerous heuristic algorithms have been developed to satisfy advanced privacy principles such as l-diversity, t-closeness, etc., the theoretical category is currently limited to k-anonymity which is the earliest principle known to have severe vulnerability to privacy attacks. Motivated by this, we present the first theoretical study on l-diversity, a popular principle that is widely adopted in the literature. First, we show that optimal l-diverse generalization is NP-hard even when there are only 3 distinct sensitive values in the microdata. Then, an (l · d)-approximation algorithm is developed, where d is the dimensionality of the underlying dataset. This is the first known algorithm with a non-trivial bound on information loss. Extensive experiments with real datasets validate the effectiveness and efficiency of proposed solution.
机译:现有的隐私保护发布解决方案可以分为理论和启发式两类。前者保证了极低的信息损失,而后者在最坏的情况下却招致巨大的损失,但凭经验证明在许多实际输入上表现良好。尽管已经开发了许多启发式算法来满足高级隐私原则,例如l-多样性,t紧密度等,但理论类别目前仅限于k-匿名性,这是已知对隐私攻击具有严重脆弱性的最早原则。因此,我们提出了关于l-多样性的第一个理论研究,l-多样性是一种在文献中被广泛采用的流行原理。首先,我们证明即使在微数据中只有3个不同的敏感值时,最优的l多元泛化也是NP难的。然后,开发了一种(l·d)近似算法,其中d是基础数据集的维数。这是第一个已知的对信息丢失具有非平凡限制的算法。使用真实数据集进行的大量实验验证了所提出解决方案的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号