首页> 外文期刊>Knowledge-Based Systems >Dense members of local cores-based density peaks clustering algorithm
【24h】

Dense members of local cores-based density peaks clustering algorithm

机译:基于局部核的密集成员的密度峰聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

An efficient clustering algorithm by fast search and find of density peaks (DP) was proposed and attracted much attention from researchers. It assumes that cluster centers are surrounded by lower density points and have a larger distance from points with higher densities. According to the characteristic of cluster centers, we can easily obtain centers from decision graph. However, DP algorithm fails to cluster manifold data sets, especially when there are a lot of noises in the manifold data sets. In this paper, we propose a dense members of local core-based density peaks clustering algorithm DLORE-DP. First, we find local cores to represent the data set. After that, only dense members of local cores are taken into consideration when computing the graph distance between local cores, avoiding the interference of noises. Then, natural neighbor-based density and the new defined graph distance are used to construct decision graph on local cores and DP algorithm is employed to cluster local cores. Finally, we assign each remaining point to the cluster its representative belongs to. The new defined graph distance helps our algorithm cluster manifold data sets and the elimination of low density points makes it more robust. Moreover, since we only calculate the graph distance between local cores, instead of all pairs of points, it greatly reduces the running time. The experimental results on synthetic and real data sets show that DLORE-DP is more effective, efficient and robust than other algorithms when clustering manifold data sets with noises. (c) 2020 Elsevier B.V. All rights reserved.
机译:提出了一种通过快速搜索和发现密度峰(DP)的有效聚类算法,并引起了研究人员的广泛关注。假定聚类中心被较低密度的点包围,并且与较高密度的点之间的距离较大。根据聚类中心的特点,我们可以很容易地从决策图中获得中心。但是,DP算法无法对流形数据集进行聚类,尤其是在流形数据集中存在大量噪声的情况下。在本文中,我们提出了一种基于局部核心的密集成员密度峰值聚类算法DLORE-DP。首先,我们找到代表数据集的局部核。此后,在计算局部核心之间的图形距离时,仅考虑局部核心的密集成员,从而避免了噪声的干扰。然后,使用基于自然邻居的密度和新定义的图距离在局部核心上构建决策图,并采用DP算法对局部核心进行聚类。最后,我们将每个剩余点分配给代表其所属的集群。新定义的图形距离可帮助我们的算法对流形数据集进行聚类,而低密度点的消除使其更加健壮。而且,由于我们仅计算局部核心之间的图形距离,而不是所有成对的点,因此大大减少了运行时间。在综合和真实数据集上的实验结果表明,当用噪声对流形数据集进行聚类时,DLORE-DP比其他算法更有效,高效和健壮。 (c)2020 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号