【24h】

Adaptive Cluster Ensemble Selection

机译:自适应集群集合选择

获取原文

摘要

Cluster ensembles generate a large number of different clustering solutions and combine them into a more robust and accurate consensus clustering. On forming the ensembles, the literature has suggested that higher diversity among ensemble members produces higher performance gain. In contrast, some studies also indicated that medium diversity leads to the best performing ensembles. Such contradicting observations suggest that different data, with varying characteristics, may require different treatments. We empirically investigate this issue by examining the behavior of cluster ensembles on benchmark data sets. This leads to a novel framework that selects ensemble members for each data set based on its own characteristics. Our framework first generates a diverse set of solutions and combines them into a consensus partition P~*. Based on the diversity between the ensemble members and P~*, a subset of ensemble members is selected and combined to obtain the final output. We evaluate the proposed method on benchmark data sets and the results show that the proposed method can significantly improve the clustering performance, often by a substantial margin. In some cases, we were able to produce final solutions that significantly outperform even the best ensemble members.
机译:Cluster Ensembles生成大量不同的聚类解决方案,并将它们组合成更强大,更准确的共识群集。在形成集合的情况下,文献表明,集合成员之间的多样性产生了更高的性能增益。相比之下,一些研究还表明媒体多样性导致最好的执行集合。这种矛盾的观察表明,具有不同特征的不同数据可能需要不同的治疗方法。我们通过检查基准数据集中的集群集合的行为来凭证调查此问题。这导致了一种新颖的框架,其基于其自身的特性为每个数据集选择集合成员。我们的框架首先生成多样化的解决方案,并将它们结合到共识分区P〜*。基于集合成员和P〜*之间的多样性,选择并组合集合成员的子集以获得最终输出。我们评估基准数据集的提出方法,结果表明,该方法可以显着提高聚类性能,通常通过大幅度的余量。在某些情况下,我们能够生产最终解决方案,即使是最好的合奏成员也要显着优于优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号