首页> 外文会议>Pacific Rim International Conference on Artificial Intelligence >An Adaptive Clustering Algorithm by Finding Density Peaks
【24h】

An Adaptive Clustering Algorithm by Finding Density Peaks

机译:通过查找密度峰值的自适应聚类算法

获取原文

摘要

Clustering by fast search and find of density peaks (shorted as DPC) is a powerful clustering algorithm. However it has a fatal problem that once a point is assigned erroneously, then there may be many more points will be assigned to error clusters. Furthermore, its density peaks need to be selected manually, so as to the clustering may be poor. Lastly it cannot find density peaks from sparse cluster when the data set comprises dense and sparse clusters simultaneously. This paper proposed a new clustering algorithm to overcome the aforementioned weaknesses of DPC by adaptively finding density peaks and assigning points to their most proper clusters. The new density ρ_i of point i was defined. The adjusting strategy for γ_i, and the assignment strategy for remaining points, and the merging strategy for erroneously partitioned clusters were proposed. Many challengeable synthetic datasets were used to test the power of the proposed algorithm. The experimental results demonstrate that the proposed algorithm can correctly detect clusters with any arbitrary shapes. Its performance is superior to DPC and its variants in terms of bench mark metrics, such as clustering accuracy (Acc), adjusted mutual information (AMI) and adjusted rand index (ARI).
机译:通过快速搜索和查找密度峰值(短路为DPC)是一种强大的聚类算法。然而,它具有一个致命的问题,就错误地分配了一个点,那么可能会将更多的点分配给错误群集。此外,需要手动选择其密度峰,以便聚类可能差。最后,当数据集同时包括密集和稀疏簇时,它无法从稀疏群集找到密度峰值。本文提出了一种新的聚类算法,通过自适应地发现密度峰值和将点分配给最适当的集群来克服DPC的上述弱点。定义了点I的新密度ρi。提出了γ_I的调整策略,以及剩余点的分配策略,以及错误分区集群的合并策略。许多可取的合成数据集用于测试所提出的算法的功率。实验结果表明,所提出的算法可以正确地检测具有任何任意形状的簇。其性能优于DPC及其在替补标准度量方面的变体,例如聚类精度(ACC),调整的互信息(AMI)和调整的RAND指数(ARI)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号