A new separation measure for improving the effectiveness of validity indices

Yue SH; Wang JS; Wu T; Wang HX

首页> 外文期刊>Information Sciences: An International Journal >A new separation measure for improving the effectiveness of validity indices

【24h】

A new separation measure for improving the effectiveness of validity indices

机译：一种提高有效性指标有效性的新分离措施

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many validity indices have been proposed for quantitatively assessing the performance of clustering algorithms. One limitation of existing indices is their lack of generalizability, due to their dependence on the specific algorithms and structures of the data space. To handle large-scale datasets with arbitrary structures, this research study proposes a new cluster separation measure for improving the effectiveness of existing validity indices. This is achieved by partitioning the original data space into a grid-based structure which allows the introduction of a new measurement for assessing the true data distribution between any two clusters instead of the distance between the two cluster prototypes. To validate the effectiveness of the proposed separation measure, we adopt two commonly used validity indices, the Davies-Bouldin's function (DB) and Tibshirani's Gap statistic (GS). These indices are denoted as R-DB-1 and R-GS-1 for clusters with sphere-shaped structures and R-DB-2 and R-GS-2 for irregular-shaped structures. This integration enables the indices to evaluate both partitional algorithms and hierarchical algorithms. Partitional algorithms including C-Means (CM), Fuzzy C-Means (FCM), and hierarchical algorithms, including DBSCAN and CLIQUE, are used to test the performance of the new indices. Two synthetic datasets with spherical structures and four synthetic datasets with irregular shapes are first compared. Five real datasets from the UCI machine learning repository are then used to further test the measure's performance. The experimental results provide evidence that the new indices outperform the original indices.

机译：已经提出了许多有效性指标来定量评估聚类算法的性能。现有索引的局限性是它们缺乏通用性，因为它们依赖于特定算法和数据空间结构。为了处理具有任意结构的大规模数据集，本研究提出了一种新的聚类分离措施，以提高现有有效性指标的有效性。这是通过将原始数据空间划分为基于网格的结构来实现的，该结构允许引入一种新的度量来评估任何两个集群之间的真实数据分布，而不是两个集群原型之间的距离。为了验证所提出的分离措施的有效性，我们采用了两个常用的有效性指标，戴维斯-布尔丁函数（DB）和提比希尔尼的差距统计量（GS）。对于具有球形结构的群集，这些索引分别表示为R-DB-1和R-GS-1，对于不规则形状的结构，这些索引分别表示为R-DB-2和R-GS-2。这种集成使索引能够评估分区算法和分层算法。包括C均值（CM），模糊C均值（FCM）在内的分区算法以及包括DBSCAN和CLIQUE在内的分层算法均用于测试新索引的性能。首先比较两个具有球形结构的合成数据集和四个具有不规则形状的合成数据集。然后，使用来自UCI机器学习存储库的五个真实数据集来进一步测试该度量的性能。实验结果提供了新指标优于原始指标的证据。

著录项

来源
《Information Sciences: An International Journal》 |2010年第5期|共17页
作者
Yue SH; Wang JS; Wu T; Wang HX;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Validity index; Clustering; Separation measure;

机译：有效性指标;聚类;分离度;

相似文献

外文文献
中文文献
专利

1. A new separation measure for improving the effectiveness of validity indices [J] . Yue SH, Wang JS, Wu T, Information Sciences: An International Journal . 2010,第5期

机译：一种提高有效性指标有效性的新分离措施
2. Strengths and Weaknesses of Two Empathy Measures: A Comparison of the Measurement Precision, Construct Validity, and Incremental Validity of Two Multidimensional Indices [J] . Murphy Brett A., Costello Thomas H., Watts Ashley L., Assessment . 2020,第2期

机译：两种同理化措施的优势和弱点：两种多维指标的测量精度，构建有效性和增量有效性的比较
3. A Cluster-Validity Index Combining an Overlap Measure and a Separation Measure Based on Fuzzy-Aggregation Operators [J] . Le Capitaine H., Frelicot C. Fuzzy Systems, IEEE Transactions on . 2011,第3期

机译：基于模糊聚合算子的重叠测度与分离测度相结合的聚类有效性指数
4. Improving Cluster Method Quality by Validity Indices [C] . N. Hachani, H. Ounalli International Florida Artificial Intelligence Research Society Conference(FLAIRS 2007); 20070507-09; Key West,FL(US) . 2007

机译：通过有效性指标提高聚类方法的质量
5. Measuring and mapping indices of biodiversity conservation effectiveness [D] . Wright, Benjamin E. 2011

机译：衡量和绘制生物多样性保护有效性的指数
6. Simplified Asset Indices to Measure Wealth and Equity in Health Programs: A Reliability and Validity Analysis Using Survey Data From 16 Countries [O] . Nirali M Chakraborty, Kenzo Fry, Rasika Behl, 2016

机译：衡量卫生计划中的财富和公平的简化资产指标：使用来自16个国家的调查数据进行的可靠性和有效性分析
7. A Cluster-Validity Index Combining an Overlap Measure and a Separation Measure Based on Fuzzy-Aggregation Operators [O] . Le Capitaine, Hoel, Frélicot, Carl 2011

机译：基于模糊聚合算子的重叠测度与分离测度相结合的聚类有效性指数
8. Variability of Measures of Weapons Effectiveness. Volume 7. Effectiveness Indices in Stick Delivery of Weapons [R] . Sivazlian, B. D. 1985

机译：武器效能测量的可变性。第7卷。武器交付的有效性指标

A new separation measure for improving the effectiveness of validity indices

摘要

著录项

相似文献

相关主题

期刊订阅