An efficient grid-based clustering method by finding density peaks

机译：通过找到密度峰值的有效的基于网格的聚类方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering or categorizing an unprocessed data set is essential and critical in many areas. Much success has been published, which first needs to calculate the mutual distances between data points. It suffers from considerable computational costs, preventing the state-of-the-art methods such as the clustering method by fast search and find of density peaks (FSFDP, published in Science, 2014) from applying into real life (e.g., with thousands of data points). In this paper, an efficient grid-based clustering (GBC) method by finding density peaks is described. It keeps the advantage of the friendly interactive interface in the FSFDP, at the mean time, decreases enormously the computation complexity. The time complexity of the FSFDP is o(np(np - 1)/2) while our method decreases it to o(np * size of (grid)), where np is the number of data points and the size of grid is always much smaller than np so that the time complexity of our approach is almost linearly proportional to np. The presented GBC method by finding density peaks was able to calculate the densities and categorize datasets within much less time, which makes the density-peak-based algorithm practical. By using the presented algorithm, it was possible to cluster high-dimensional data sets as well. The GBC method by finding density peaks was successfully verified in clustering several datasets, which are commonly used to test clustering algorithms in published articles. It turned out that the presented method is much faster and efficient in clustering datasets into different categories than the conventional density-based ones, which makes the proposed method more preferable.

机译：在许多领域中，对未处理的数据集进行聚类或分类至关重要。已经取得了很多成功，这首先需要计算数据点之间的相互距离。它遭受了可观的计算成本，阻止了诸如通过快速搜索和发现密度峰的聚类方法（FSFDP，Science，2014年）之类的最新方法应用于现实生活中（例如，成千上万个数据点）。在本文中，描述了一种通过找到密度峰值的有效基于网格的聚类（GBC）方法。它保留了FSFDP中友好的交互式界面的优势，与此同时，大大降低了计算复杂度。 FSFDP的时间复杂度为o（np（np-1）/ 2），而我们的方法将其降低为o（np *（grid）的大小），其中np是数据点的数量，网格的大小始终是比np小得多，因此我们方法的时间复杂度几乎与np成线性比例关系。提出的通过发现密度峰值的GBC方法能够在更短的时间内计算出密度并对数据集进行分类，这使基于密度峰的算法变得实用。通过使用提出的算法，也可以对高维数据集进行聚类。通过查找密度峰值的GBC方法已成功地聚类了几个数据集，这些数据集通常用于测试已发表文章中的聚类算法。结果表明，与传统的基于密度的方法相比，该方法在将数据集聚类到不同类别中方面要更快，更有效，这使得该方法更为可取。

著录项

来源
《Annual Conference of the IEEE Industrial Electronics Society》|2016年|837-842|共6页
会议地点
作者
Bo Wu; B. M. Wilamowski;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Clustering methods; Standards; Time complexity; Algorithm design and analysis; Shape;

机译：聚类算法;聚类方法;标准;时间复杂度;算法设计与分析;形状;

相似文献

外文文献
中文文献
专利

1. Secure grid-based density peaks clustering on hybrid cloud for industrial IoT [J] . Sun Liping, Ci Shang, Liu Xiaoqing, International Journal of Network Management . 2021,第2期

机译：基于牢固的基于网格的密度峰集聚类在工业物联网上的混合云上
2. An improved density peaks clustering algorithm with fast finding cluster centers [J] . Xiao Xu, Shifei Ding, Zhongzhi Shi Knowledge-Based Systems . 2018,第OCTa15期

机译：一种具有快速发现聚类中心的改进的密度峰聚类算法
3. Adaptive Partitioning by Local Density-Peaks: An Efficient Density-Based Clustering Algorithm for Analyzing Molecular Dynamics Trajectories [J] . Liu Song, Zhu Lizhe, Sheong Fu Kit, Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological . 2017,第3a4期

机译：通过局部密度峰值的自适应分区：一种高效的基于密度的聚类算法，用于分析分子动力学轨迹
4. An efficient grid-based clustering method by finding density peaks [C] . Bo Wu, B. M. Wilamowski Annual Conference of the IEEE Industrial Electronics Society . 2016

机译：通过找到密度峰值的基于基于网格的聚类方法
5. Efficient grid-based techniques for density functional theory . [D] . Rodriguez-Hernandez, Juan Ignacio. 2008

机译：高效的基于网格的密度泛函理论技术。
6. flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding [O] . Yongchao Ge, Stuart C. Sealfon -1

机译：flowPeaks：通过K均值和密度峰发现对流式细胞术数据进行快速无监督的聚类
7. Efficient Clustering Method Based on Density Peaks With Symmetric Neighborhood Relationship [O] . Chunrong Wu, Jia Lee, Teijiro Isokawa, 2019

机译：基于密度峰值与对称邻域关系的有效聚类方法

An efficient grid-based clustering method by finding density peaks

摘要

著录项

相似文献

相关主题

期刊订阅