首页> 外文会议>International Conference on Pattern Recognition and Image Analysis >A Graph-based Density Peaks Method by Employing Shortest Path for Data Clustering
【24h】

A Graph-based Density Peaks Method by Employing Shortest Path for Data Clustering

机译:基于图形的密度峰值方法通过采用数据群集的最短路径

获取原文

摘要

Data clustering is one of the most important and fundamental tasks of machine learning. Data clustering aims at dividing a set of objects into several groups according to their similarities. In recent years Density Peaks Clustering (DPC) was introduced as a fast and non-iterative clustering method which does not require any previous knowledge about the number of clusters. However, this method suffers from a few shortcomings such as its sensitivity to the user-adjustable parameter, disability to consider data distribution, and inappropriate center selection when facing complex clusters. To overcome these issues, in this paper, a novel density-based peaks clustering method called GDPCS is proposed. By employing the properties of the mutual neighborhood graph and shortest path distance, the proposed method considers the data distribution, present a better shape of clusters, and reduces the clusters' connectivity. To demonstrate the proposed method's effectiveness and superiority, many experiments were performed on both real-world and synthetic datasets. The obtained results show that the proposed method has achieved an acceptable result on imbalanced and complex shaped clusters and can detect more appropriate centers.
机译:数据聚类是机器学习最重要和最基本的任务之一。数据群集旨在根据其相似之处将一组对象划分为几个组。近年来,密度峰值聚类(DPC)被引入为快速和非迭代聚类方法,不需要对群集数量的任何知识。然而,这种方法遭受了一些缺点,例如对用户可调参数的敏感性,禁用数据分布,并且在面对复杂的集群时选择不适当的中心选择。为了克服这些问题,提出了一种名为GDPC的新型基于密度的峰集聚类方法。通过采用相邻邻域图和最短路径距离的特性,所提出的方法认为数据分布,呈现更好的簇形状,并降低了集群的连接。为了展示所提出的方法的有效性和优越性,许多实验是对现实世界和合成数据集进行的。所得结果表明,该方法已经在不平衡和复杂的形状簇上实现了可接受的结果,并且可以检测更合适的中心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号