首页> 中文期刊> 《计算机工程》 >基于扩展网格和密度的数据流聚类算法

基于扩展网格和密度的数据流聚类算法

         

摘要

With regard to the previous phenomenon,when traditional clustering algorithms compute grid density without considering the surrounding space which leads to the unsmoothed clustering,this paper presents a data stream clustering algorithm based on extensible grid and density. By dynamically determining the grid expansion area, the algorithm reasonably expands the calculation range of grid density from this grid to the adjacent ones, and then according to the cohesion degree which is introduced from algorithm to measure the impact of surrounding data on grid density. In order to further outline the distribution of the clustering edges, the algorithm uses the boundary threshold value method which separates the boundary points from the noise. Furthermore, the algorithm puts forward an improved grid combining method which is on the basis of the judgment of inter-cluster connectivity to simplify the combination of grid clusters,and this effectively reduces the execution time of the algorithm. Experimental results show that the algorithm has higher clustering quality and efficiency.%针对现有聚类算法在计算网格密度时未考虑周围空间的影响因素而导致聚类边界不平滑的现象,提出一种基于扩展网格和密度的数据流聚类算法。通过动态确定网格扩展区域,将网格密度计算范围从本网格合理地扩展到相邻网格空间,进而根据算法中引入的凝聚度衡量周围空间数据点对网格密度的影响。为进一步精确聚类边缘的轮廓分布情况,使用边界点距离阈值函数从噪声中分离出类的边界点,并给出一种改进的网格合并方法,根据簇间连通性简化网格簇合并的判断条件,有效减少算法执行时间。实验结果表明,该算法具有较高的聚类质量和聚类效率。

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号