首页> 外文期刊>Informatica: An International Journal of Computing and Informatics >AMF-IDBSCAN: Incremental Density Based Clustering Algorithm Using Adaptive Median Filtering Technique
【24h】

AMF-IDBSCAN: Incremental Density Based Clustering Algorithm Using Adaptive Median Filtering Technique

机译:AMF-IDBSCAN:使用自适应中值滤波技术的基于增量密度的聚类算法

获取原文
       

摘要

Density-based spatial clustering of applications with noise (DBSCAN) is a fundament algorithm for density-based clustering. It can discover clusters of arbitrary shapes and sizes from a large amount of data, which is containing noise and outliers. However, it fails to treat large datasets, to attend to outperforming when new data objects are inserted into the existing database, to remove totally a noise points and outliers and to handle the local density variation that exists within the cluster. So, a good clustering method should allow a significant density modification within the cluster and should learn a dynamics and large databases. In this paper, an enhancement of the DBSCAN algorithm is proposed based on incremental clustering called AMF-IDBSCAN which builds incrementally the clusters of different shapes and sizes in large datasets and eliminates the presence of noise and outliers. The proposed AMF-IDBSCAN algorithm uses a canopy clustering algorithm to pre-clustering datasets to decrease the volume of data, applies an incremental DBSCAN for clustering the data points and Adaptive Median Filtering (AMF) technique for post-clustering to reduce the number of outliers by replacing noises by chosen medians. Experimental results are obtained from the University California Irvine (UCI) repository UCI data sets. The final results show that our algorithm get good results with respect to the famous DBSCAN, IDBSCAN, and DMDBSCAN?.
机译:具有噪声的应用程序的基于密度的空间聚类(DBSCAN)是基于密度的聚类的基本算法。它可以从大量包含噪声和异常值的数据中发现任意形状和大小的簇。但是,它无法处理大型数据集,无法在将新数据对象插入现有数据库中时表现出色,无法完全消除噪声点和异常值,也无法处理集群中存在的局部密度变化。因此,一种好的聚类方法应允许在聚类中进行重大的密度修改,并应学习动力学和大型数据库。本文提出了一种基于增量聚类(称为AMF-IDBSCAN)的DBSCAN算法的增强功​​能,该聚类可在大型数据集中逐步构建不同形状和大小的聚类,并消除了噪声和异常值。提出的AMF-IDBSCAN算法使用冠层聚类算法对数据集进行预聚类以减少数据量,应用增量DBSCAN聚类数据点,并采用自适应中值滤波(AMF)技术进行聚类,以减少离群数通过用选定的中位数代替噪声。实验结果是从加州大学尔湾分校(UCI)存储库UCI数据集获得的。最终结果表明,相对于著名的DBSCAN,IDBSCAN和DMDBSCAN ?,我们的算法获得了良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号