首页> 外文会议>Web Information System and Application Conference >A Hierarchical Clustering for Categorical Data Based on Holo-Entropy
【24h】

A Hierarchical Clustering for Categorical Data Based on Holo-Entropy

机译:基于全息熵的分类数据分层聚类

获取原文

摘要

High dimensional data clustering is a difficult task in clustering analysis. Subspace clustering is an effective approach. The principle of subspace clustering is to maximize the retention of the original data information while searching for the minimal size of subspace for cluster representation. Based on information entropy and Holo-entropy, we propose an adaptive high dimensional weighted subspace clustering algorithm. The algorithm employs information entropy to extract the feature subspace, uses class compactness which binding Holo-entropy with weight in subspace for sub-clusters merging instead of the traditional similarity measurement method, and it selects the most compacted two sub-clusters to merge to achieve the maximum degree clustering effect. The algorithm is tested on nine UCI dataset, and compared with other algorithms. Our algorithm is better in both efficiency and accuracy than the other existing algorithms and has high reproducibility.
机译:高维数据聚类是聚类分析中的一项艰巨任务。子空间聚类是一种有效的方法。子空间聚类的原理是在搜索子空间的最小大小以进行聚类表示时,最大程度地保留原始数据信息。基于信息熵和全息熵,提出了一种自适应的高维加权子空间聚类算法。该算法利用信息熵提取特征子空间,利用类紧密性将子空间中的Holo熵与权重结合起来进行子聚类的合并,代替了传统的相似性度量方法,并选择压缩最紧凑的两个子聚类进行合并以实现最大程度的聚类效果。该算法在9个UCI数据集上进行了测试,并与其他算法进行了比较。我们的算法在效率和准确性上都优于其他现有算法,并且具有很高的可重复性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号