首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Minimum spanning tree partitioning algorithm for microaggregation
【24h】

Minimum spanning tree partitioning algorithm for microaggregation

机译:用于最小聚合的最小生成树划分算法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a clustering algorithm for partitioning a minimum spanning tree with a constraint on minimum group size. The problem is motivated by microaggregation, a disclosure limitation technique in which similar records are aggregated into groups containing a minimum of k records. Heuristic clustering methods are needed since the minimum information loss microaggregation problem is NP-hard. Our MST partitioning algorithm for microaggregation is sufficiently efficient to be practical for large data sets and yields results that are comparable to the best available heuristic methods for microaggregation. For data that contain pronounced clustering effects, our method results in significantly lower information loss. Our algorithm is general enough to accommodate different measures of information loss and can be used for other clustering applications that have a constraint on minimum group size.
机译:本文提出了一种聚类算法,用于对最小生成树进行分区,并限制最小组大小。该问题是由微聚合引起的,微聚合是一种公开限制技术,其中类似的记录被聚合到包含最少k个记录的组中。由于最小的信息丢失微聚集问题是NP难的,因此需要启发式聚类方法。我们的用于微聚合的MST分区算法足够有效,可以用于大型数据集,并且所产生的结果可与用于微聚合的最佳启发式方法相媲美。对于包含明显聚类效果的数据,我们的方法可显着降低信息丢失。我们的算法足够通用,可以适应不同程度的信息丢失,并且可以用于对最小组大小有限制的其他群集应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号