Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

Brito Da Silva Leonardo Enzo; Melton Niklas Max; Wunsch Donald C.

首页> 外文期刊>Quality Control, Transactions >Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

【24h】

Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

机译：用于在线学习的增量群体有效性指数：扩展和比较研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Validation is one of the most important aspects of clustering, particularly when the user is designing a trustworthy or explainable system. However, most clustering validation approaches require batch calculation. This is an important gap because of the value of clustering in real-time data streaming and other online learning applications. Therefore, interest has grown in providing online alternatives for validation. This paper extends the incremental cluster validity index (iCVI) family by presenting incremental versions of Calinski-Harabasz (iCH), Pakhira-Bandyopadhyay-Maulik (iPBM), WB index (iWB), Silhouette (iSIL), Negentropy Increment (iNI), Representative Cross Information Potential (irCIP), Representative Cross Entropy (irH), and Conn & x005F;Index (iConn & x005F;Index). This paper also provides a thorough comparative study of correct, under- and over-partitioning on the behavior of these iCVIs, the Partition Separation (PS) index as well as four recently introduced iCVIs: incremental Xie-Beni (iXB), incremental Davies-Bouldin (iDB), and incremental generalized Dunn & x2019;s indices 43 and 53 (iGD43 and iGD53). Experiments were carried out using a framework that was designed to be as agnostic as possible to the clustering algorithms. The results on synthetic benchmark data sets showed that while evidence of most under-partitioning cases could be inferred from the behaviors of the majority of these iCVIs, over-partitioning was found to be a more challenging problem, detected by fewer of them. Interestingly, over-partitioning, rather then under-partitioning, was more prominently detected on the real-world data experiments within this study. The expansion of iCVIs provides significant novel opportunities for assessing and interpreting the results of unsupervised lifelong learning in real-time, wherein samples cannot be reprocessed due to memory and/or application constraints.

机译：验证是聚类最重要的方面之一，特别是当用户设计值得信赖或可解释的系统时。但是，大多数聚类验证方法都需要批量计算。这是一个重要的缺口，因为在实时数据流和其他在线学习应用程序中的聚类价值。因此，利息已经在提供验证的在线替代方面。本文通过呈现Calinski-Harabasz（ICH），Pakhira-Bandyopadhyay-Maulik（IPBM），WB指数（IWB），剪影（ISIL），上对应增量（INI）来扩展增量群集有效性指数（ICVI）系列系列系列代表性交叉信息潜力（IRCIP），代表性交叉熵（IRH），以及Conn＆x005f;索引（iconn＆x005f;索引）。本文还提供了对这些ICVIS的行为的正确，下划分的彻底的比较研究，分区分离（PS）指数以及四个最近引入的ICVIS：增量Xie-Beni（IXB），增量戴维斯 - BOULDEN（IDB）和增量通用DUNN＆X2019; S索引43和53（IGD43和IGD53）。使用框架进行实验，该框架被设计为尽可能不可知的聚类算法。合成基准数据集的结果表明，虽然可以从大多数划分情况下推断出大多数划分的案件的证据，但发现过度分区是一个更具挑战性的问题，而不是较少的问题。有趣的是，在本研究中的真实数据实验中，更突出地检测到过度分区，而不是划分的划分。 ICVIS的扩展为实时评估和解释无监督终身学习的结果提供了重要的新机遇，其中由于内存和/或应用约束，不能再处理样品。

著录项

来源
《Quality Control, Transactions》 |2020年第2020期|22025-22047|共23页
作者
Brito Da Silva Leonardo Enzo; Melton Niklas Max; Wunsch Donald C.;
展开▼
作者单位

Missouri Univ Sci & Technol Appl Computat Intelligence Lab Rolla MO 65409 USA|Minist Educ Brazil CAPES Fdn BR-70040020 Brasilia DF Brazil;

Missouri Univ Sci & Technol Appl Computat Intelligence Lab Rolla MO 65409 USA;

Missouri Univ Sci & Technol Appl Computat Intelligence Lab Rolla MO 65409 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering; validation; incremental cluster validity index (iCVI); adaptive resonance theory (ART); incremental (online) clustering algorithms; data streams;

机译：群集;验证;增量簇有效性指数（ICVI）;自适应共振理论（艺术）;增量（在线）聚类算法;数据流;

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of the Data Clustering Techniques and Cluster Validity Indices for Efficient Toolpath Development for Incremental Sheet Forming [J] . Aniket Nagargoje, Pavan K. Kankar, Prashant K. Jain, Journal of Computing and Information Science in Engineering . 2021,第3期

机译：用于增量板成形的高效刀具路径开发的数据聚类技术和群集有效指标的性能评估
2. Comparative Study of Clustering Methods over Ill- Structured Datasets using Validity Indices [J] . Sheik Faritha Begum, K. P. Kaliyamurthie, A. Rajesh Indian Journal of Science and Technology . 2016,第12期

机译：使用有效性指标对结构不良数据集的聚类方法进行比较研究
3. An extensive comparative study of cluster validity indices [J] . Arbelaitz O., Gurrutxaga I., Muguerza J., Pattern Recognition: The Journal of the Pattern Recognition Society . 2013,第1期

机译：聚类有效性指标的广泛比较研究
4. A Study on the Relationship between Internal and External Validity Indices Applied to Partitioning and Density-based Clustering Algorithms [C] . Caroline Tomasini, Eduardo N. Borges, Karina Machado, International Conference on Enterprise Information Systems . 2017

机译：基于密度的聚类算法的内部和外部有效性指标关系的研究
5. Examining the performance of population-based incremental learning and island model population-based incremental learning on a GA-hard problem with a very large search space. [D] . Brownlee, Benjamin Richard. 2010

机译：检查具有很大搜索空间的GA难题的基于人口的增量学习和基于岛模型的基于人口的增量学习的性能。
6. Brain Tissue Classification Based on Diffusion Tensor Imaging: A Comparative Study Between Some Clustering Algorithms and Their Effect on Different Diffusion Tensor Imaging Scalar Indices [O] . Ihab Elaff 2016

机译：基于扩散张量成像的脑组织分类：一些聚类算法及其对不同扩散张量成像标量指标影响的比较研究
7. Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study [O] . Leonardo Enzo Brito Da Silva, Niklas Max Melton, Donald C. Wunsch 2020

机译：用于在线学习的增量群体有效性指数：扩展和比较研究

Incremental Cluster Validity Indices for Online Learning of Hard Partitions: Extensions and Comparative Study

摘要

著录项

相似文献

相关主题

期刊订阅