...
首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >Diversity based cluster weighting in cluster ensemble: an information theory approach
【24h】

Diversity based cluster weighting in cluster ensemble: an information theory approach

机译:集群集合中基于多样性的群集加权:信息理论方法

获取原文
获取原文并翻译 | 示例

摘要

Clustering ensemble has been increasingly popular in the recent years by consolidating several base clustering methods into a probably better and more robust one. However, cluster dependability has been ignored in the majority of the presented clustering ensemble methods that exposes them to the risk of the low-quality base clustering methods (and consequently the low-quality base clusters). In spite of some attempts made to evaluate the clustering methods, it seems that they consider each base clustering individually regardless of the diversity. In this study, a new clustering ensemble approach has been proposed using a weighting strategy. The paper has presented a method for performing consensus clustering by exploiting the cluster uncertainty concept. Indeed, each cluster has a contribution weight computed based on its undependability. All of the predicted cluster tags available in the ensemble are used to evaluate a cluster undependability based on an information theoretic measure. The paper has proposed two measures based on cluster undependability or uncertainty to estimate the cluster dependability or certainty. The multiple clusters are reconciled through the cluster uncertainty. A clustering ensemble paradigm has been proposed through the proposed method. The paper has proposed two approaches to achieve this goal: a cluster-wise weighted evidence accumulation and a cluster-wise weighted graph partitioning. The former approach is based on hierarchical agglomerative clustering and co-association matrices, while the latter is based on bi-partite graph formulating and partitioning. In the first step of the former, the cluster-wise weighing co-association matrix is proposed for representing a clustering ensemble. The proposed approaches have been then evaluated on 19 real-life datasets. The experimental evaluation has revealed that the proposed methods have better performances than the competing methods; i.e. through the extensive experiments on the real-world datasets, it has been concluded that the proposed method outperforms the state-of-the-art. The substantial experiments on some benchmark data sets indicate that the proposed methods can effectively capture the implicit relationship among the objects with higher clustering accuracy, stability, and robustness compared to a large number of the state-of-the-art techniques, supported by statistical analysis.
机译:近年来,群集集群越来越受欢迎,通过将几种基础聚类方法巩固到可能更好,更强大的内容。但是,在大多数所呈现的聚类集群方法中忽略了集群可靠性,该方法将它们暴露于低质量基础聚类方法的风险(因此,因此低质量的基本集群)。尽管有一些尝试评估聚类方法,但似乎他们认为无论多样性如何单独考虑每个基本聚类。在本研究中,已经使用加权策略提出了一种新的聚类集群方法。本文提出了一种通过利用集群不确定性概念来执行共识群集的方法。实际上,每个群集基于其不依赖性计算的贡献权重。合奏中可用的所有预测的集群标签用于根据信息理论测量评估群集不依赖性。本文提出了两种措施,基于集群不可依赖性或不确定性来估计集群可靠性或确定性。多个集群通过群集不确定性协调。通过提出的方法提出了聚类集群范式。本文提出了两种方法来实现这一目标:群集加权证据积累和集群 - 明智加权图分区。前一种方法是基于分层凝聚聚类和共关联矩阵,而后者基于双辅导图制定和分配。在前者的第一步中,提出了组合称重共协会矩阵,用于表示集群集群。然后在19个现实生活数据集中评估了所提出的方法。实验评价显示,所提出的方法具有比竞争方法更好的表现;即,通过对现实世界数据集的广泛实验,已经得出结论,该方法优于最先进的方法。一些基准数据集的实质实验表明,与大量最先进的技术相比,所提出的方法可以有效地捕获具有更高聚类精度,稳定性和鲁棒性的对象之间的隐式关系。分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号