Diversity based cluster weighting in cluster ensemble: an information theory approach

Rashidi Frouzan; Nejatian Samad; Parvin Hamid; Rezaie Vahideh

首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >Diversity based cluster weighting in cluster ensemble: an information theory approach

【24h】

Diversity based cluster weighting in cluster ensemble: an information theory approach

机译：集群集合中基于多样性的群集加权：信息理论方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering ensemble has been increasingly popular in the recent years by consolidating several base clustering methods into a probably better and more robust one. However, cluster dependability has been ignored in the majority of the presented clustering ensemble methods that exposes them to the risk of the low-quality base clustering methods (and consequently the low-quality base clusters). In spite of some attempts made to evaluate the clustering methods, it seems that they consider each base clustering individually regardless of the diversity. In this study, a new clustering ensemble approach has been proposed using a weighting strategy. The paper has presented a method for performing consensus clustering by exploiting the cluster uncertainty concept. Indeed, each cluster has a contribution weight computed based on its undependability. All of the predicted cluster tags available in the ensemble are used to evaluate a cluster undependability based on an information theoretic measure. The paper has proposed two measures based on cluster undependability or uncertainty to estimate the cluster dependability or certainty. The multiple clusters are reconciled through the cluster uncertainty. A clustering ensemble paradigm has been proposed through the proposed method. The paper has proposed two approaches to achieve this goal: a cluster-wise weighted evidence accumulation and a cluster-wise weighted graph partitioning. The former approach is based on hierarchical agglomerative clustering and co-association matrices, while the latter is based on bi-partite graph formulating and partitioning. In the first step of the former, the cluster-wise weighing co-association matrix is proposed for representing a clustering ensemble. The proposed approaches have been then evaluated on 19 real-life datasets. The experimental evaluation has revealed that the proposed methods have better performances than the competing methods; i.e. through the extensive experiments on the real-world datasets, it has been concluded that the proposed method outperforms the state-of-the-art. The substantial experiments on some benchmark data sets indicate that the proposed methods can effectively capture the implicit relationship among the objects with higher clustering accuracy, stability, and robustness compared to a large number of the state-of-the-art techniques, supported by statistical analysis.

机译：近年来，群集集群越来越受欢迎，通过将几种基础聚类方法巩固到可能更好，更强大的内容。但是，在大多数所呈现的聚类集群方法中忽略了集群可靠性，该方法将它们暴露于低质量基础聚类方法的风险（因此，因此低质量的基本集群）。尽管有一些尝试评估聚类方法，但似乎他们认为无论多样性如何单独考虑每个基本聚类。在本研究中，已经使用加权策略提出了一种新的聚类集群方法。本文提出了一种通过利用集群不确定性概念来执行共识群集的方法。实际上，每个群集基于其不依赖性计算的贡献权重。合奏中可用的所有预测的集群标签用于根据信息理论测量评估群集不依赖性。本文提出了两种措施，基于集群不可依赖性或不确定性来估计集群可靠性或确定性。多个集群通过群集不确定性协调。通过提出的方法提出了聚类集群范式。本文提出了两种方法来实现这一目标：群集加权证据积累和集群 - 明智加权图分区。前一种方法是基于分层凝聚聚类和共关联矩阵，而后者基于双辅导图制定和分配。在前者的第一步中，提出了组合称重共协会矩阵，用于表示集群集群。然后在19个现实生活数据集中评估了所提出的方法。实验评价显示，所提出的方法具有比竞争方法更好的表现;即，通过对现实世界数据集的广泛实验，已经得出结论，该方法优于最先进的方法。一些基准数据集的实质实验表明，与大量最先进的技术相比，所提出的方法可以有效地捕获具有更高聚类精度，稳定性和鲁棒性的对象之间的隐式关系。分析。

著录项

来源
《Artificial Intelligence Review: An International Science and Engineering Journal 》 |2019年第2期| 共28页
作者
Rashidi Frouzan; Nejatian Samad; Parvin Hamid; Rezaie Vahideh;
展开▼
作者单位

Islamic Azad Univ Yasooj Branch Dept Comp Engn Yasuj Iran;

Islamic Azad Univ Yasooj Branch Young Researchers &

Elite Club Yasuj Iran;

Islamic Azad Univ Nourabad Mamasani Branch Dept Comp Engn Nourabad Mamasani Iran;

Islamic Azad Univ Yasooj Branch Young Researchers &

Elite Club Yasuj Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类人工智能理论 ;
关键词
Clustering; Clustering ensemble; Consensus function; Cluster dependability; Cluster weighting;

机译：群集;聚类合奏;共识功能;群集可靠性;群集加权;

相似文献

外文文献
中文文献
专利

1. Diversity based cluster weighting in cluster ensemble: an information theory approach [J] . Rashidi Frouzan, Nejatian Samad, Parvin Hamid, Artificial Intelligence Review: An International Science and Engineering Journal . 2019 ,第2期

机译：集群集合中基于多样性的群集加权：信息理论方法
2. Dependability‐based cluster weighting in clustering ensemble [J] . Fatemeh Najafi, Hamid Parvin, Kamal Mirzaie, Statistical Analysis and Data Mining . 2020 ,第2期

机译：集群集群中基于可靠性的群集加权
3. Dependability-based cluster weighting in clustering ensemble [J] . Industrial and organizational psychology . 2020 ,第2期

机译：集群集群基于可靠性的群集加权
4. Diversity-based Weighting Schemes for Clustering Ensembles [C] . Francesco Gullo, Andrea Tagarelli, Sergio Greco SIAM International Conference on Data Mining . 2009

机译：集群集群基于分集的权重方案
5. System for persona ensemble clustering: A cluster ensemble approach to persona development. [D] . Brickey, Jonalan. 2010

机译：角色整合聚类系统：一种用于角色发展的群集整合方法。
6. Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach [O] . Ali Sabah, Sabrina Tiun, Nor Samsiah Sani, 2021

机译：基于MultiView多重特派复断的共识群集（MMCC）方法增强基于MultiView Multimirepration的群集群集模型
7. Diversity-based Weighting Schemes for Clustering Ensembles [O] . Francesco Gullo, Andrea Tagarelli, Sergio Greco 2012

机译：基于多样性的聚类加权方案

Diversity based cluster weighting in cluster ensemble: an information theory approach

摘要

著录项

相似文献

相关主题

期刊订阅