A fast clustering algorithm based on grid and density

机译：基于网格和密度的快速聚类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The efficiency of data mining algorithms is a very important issue as data becoming larger and larger. Density-based clustering analysis can discover clusters with arbitrary shape and is insensitive to noise data. The advantage of grid-based clustering method is linear time complexity. In this paper, we present a new clustering algorithm CLUGD relying on grid and density. We first construct a grid of relevant portion. Then the algorithm finds references by grid and classifies these references to core references and bound references. Then it attaches the data of the bound references to the nearest core references and aggregation the core references in neighboring portions. At last, in-direct graph is used to classify these core references and maps cluster to original data. We performed an experimental evaluation of effectiveness and efficiency of CLUGD using synthetic data and the data of the SEQUOIA 2000 Benchmark. Both theory analysis and experimental results confirm that CLUGD can discover clusters with arbitrary shape and is insensitive to noise data. In the meanwhile, its executing efficiency is much higher than DBSCAN algorithm based on R*-tree

机译：随着数据越来越大，数据挖掘算法的效率是一个非常重要的问题。基于密度的聚类分析可以发现具有任意形状的聚类，并且对噪声数据不敏感。基于网格的聚类方法的优点是线性时间复杂度。在本文中，我们提出了一种新的基于网格和密度的聚类算法CLUGD。我们首先构造一个相关部分的网格。然后，该算法按网格查找参考，并将这些参考分类为核心参考和绑定参考。然后，它将绑定引用的数据附加到最近的核心引用，并将核心引用聚合到相邻部分中。最后，使用间接图对这些核心参考进行分类，并将聚类映射到原始数据。我们使用合成数据和SEQUOIA 2000 Benchmark的数据对CLUGD的有效性和效率进行了实验评估。理论分析和实验结果均证实，CLUGD可以发现任意形状的簇，并且对噪声数据不敏感。同时，它的执行效率比基于R * -tree的DBSCAN算法要高得多。

著录项

来源
《Electrical and Computer Engineering, 2005. Canadian Conference on》||p.2276-2279|共4页
会议地点
作者
Zhiwei Sun; Zheng Zhao; Hongmei Wang; Maode Ma; Lianfang Zhang; Yantai Shu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Fast and stable clustering analysis based on Grid-mapping K-means algorithm and new clustering validity index [J] . Zhu Erzhou, Zhang Yuanxiang, Wen Peng, Neurocomputing . 2019,第Octa21期

机译：基于网格映射K-means算法和新的聚类有效性指标的快速稳定聚类分析
2. GRIDEN: An effective grid-based and density-based spatial clustering algorithm to support parallel computing [J] . Deng Chao, Song Jinwei, Sun Ruizhi, Pattern recognition letters . 2018,第JULa15期

机译：GRIDEN：一种有效的基于网格和基于密度的空间聚类算法，可支持并行计算
3. Gridwave: a grid-based clustering algorithm for market transaction data based on spatial-temporal density-waves and synchronization [J] . Chao Deng, Jinwei Song, Ruizhi Sun, Multimedia Tools and Applications . 2018,第22期

机译：Gridwave：一种基于网格的基于时空密度波和同步的市场交易数据聚类算法
4. A Grid and Density Based Fast Spatial Clustering Algorithm [C] . Huang Ming, Bian Fuling International Conference on Artificial Intelligence and Computational Intelligence;AICI '09 . 2009

机译：基于网格和密度的快速空间聚类算法
5. On density-based and representative-based spatial clustering algorithms. [D] . Chen, Chun-Sheng. 2011

机译：基于密度和基于代表的空间聚类算法。
6. Fast Nonparametric Density-Based Clustering of Large Data Sets Using a Stochastic Approximation Mean-Shift Algorithm [O] . Ollivier Hyrien, Andrea Baran -1

机译：使用随机逼近均值漂移算法的大型数据集基于非参数密度的快速聚类
7. A study of density-grid based clustering algorithms on data streams [O] . Amini A., Saybani M.R., Sahaf Yazdi S.R.A. 2011

机译：基于密度网格的数据流聚类算法研究

A fast clustering algorithm based on grid and density

摘要

著录项

相似文献

相关主题

期刊订阅