首页> 外文会议>IEEE Annual Computer Software and Applications Conference >A Local Cores-Based Hierarchical Clustering Algorithm for Data Sets with Complex Structures
【24h】

A Local Cores-Based Hierarchical Clustering Algorithm for Data Sets with Complex Structures

机译:复杂结构数据集的基于局部核的分层聚类算法

获取原文

摘要

Hierarchical clustering is of great importance in data analysis. Although there are a number of hierarchical clustering algorithms including agglomerative methods, divisive methods and hybrid methods, most of them are sensitive to noise points, suffer from high computational cost and cannot effectively discover clusters with complex structures. When recognizing patterns from complex structures, humans intuitively tend to discover obvious clusters in dense regions firstly and then deal with objects on the border. Inspired by this idea, we propose a local cores-based hierarchical clustering algorithm called HCLORE. The proposed method first partitions the data set into several clusters by finding local cores, instead of optimizing an objective function through iteration like K-means; then, temporarily removes points with lower local density, so that the boundary between clusters is clearer; after that, merges clusters according to a new defined similarities between clusters; and finally, points with lower local density are assigned to the same clusters as their local cores belong to. The experimental results on synthetic data sets and real data sets show that our algorithm is more effective and efficient than existing methods when processing data sets with complex structures.
机译:层次聚类在数据分析中非常重要。尽管有许多分层聚类算法,包括凝聚方法,分裂方法和混合方法,但是它们大多数对噪声点敏感,计算成本高并且不能有效地发现具有复杂结构的聚类。当从复杂的结构中识别出图案时,人类直观地倾向于先在密集的区域中发现明显的簇,然后再处理边界上的物体。受此想法启发,我们提出了一种称为HCLORE的基于局部内核的分层聚类算法。所提出的方法首先通过找到局部核心将数据集划分为几个集群,而不是像K-means这样的迭代来优化目标函数。然后,临时去除局部密度较低的点,使聚类之间的边界更清晰。之后,根据新定义的聚类之间的相似性合并聚类;最后,将具有较低局部密度的点分配给与其局部核心所属的相同簇。综合数据集和真实数据集的实验结果表明,当处理具有复杂结构的数据集时,我们的算法比现有方法更加有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号