...
首页> 外文期刊>IAENG Internaitonal journal of computer science >Rough K-modes Clustering Algorithm Based on Entropy
【24h】

Rough K-modes Clustering Algorithm Based on Entropy

机译:基于熵的粗糙K模式聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

Cluster analysis is an important technique used in data mining. Categorical data clustering has received a great deal of attention in recent years. Some existing algorithms for clustering categorical data do not consider the importance of attributes for clustering, thereby reducing the efficiency of clustering analysis and limiting its application. In this paper, we propose a novel rough k-modes clustering algorithm based on entropy. First, we integrated the knowledge of information entropy to define a new dissimilarity measure that takes into account the importance of attributes for clustering and improves the quality of clustering. Then, applying the theory of rough set analysis, we used upper and lower approximation to deal with uncertain clusters, which allowed us to offer an improved solution for uncertainty analysis. Finally, our experimental results demonstrated that our proposed algorithm performed better than other conventional clustering algorithms in terms of clustering accuracy, purity, and F1-measure.
机译:聚类分析是数据挖掘中使用的一项重要技术。近年来,分类数据聚类得到了广泛的关注。现有的一些用于对分类数据进行聚类的算法并未考虑属性对聚类的重要性,从而降低了聚类分析的效率并限制了其应用。在本文中,我们提出了一种新的基于熵的粗糙k模式聚类算法。首先,我们整合了信息熵的知识,以定义一种新的差异度量,该度量考虑了聚类属性的重要性并提高了聚类的质量。然后,应用粗糙集分析的理论,我们使用上下近似来处理不确定性聚类,这为我们提供了一种改进的不确定性分析解决方案。最后,我们的实验结果表明,我们提出的算法在聚类精度,纯度和F1度量方面表现优于其他常规聚类算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号