【24h】

A Bounded Index for Cluster Validity

机译:聚类有效性的有界索引

获取原文
获取原文并翻译 | 示例

摘要

Clustering is one of the most well known types of unsuper vised learning. Evaluating the quality of results and determining the number of clusters in data is an important issue. Most current validity indices only cover a subset of important aspects of clusters. Moreover, these indices are relevant only for data sets containing at least two clusters. In this paper, a new bounded index for cluster validity, called the score function (SF), is introduced. The score function is based on standard cluster properties. Several artificial and real-life data sets are used to evaluate the performance of the score function. The score function is tested against four existing validity indices. The index proposed in this paper is found to be always as good or better than these indices in the case of hyperspheroidal clusters. It is shown to work well on multidimensional data sets and is able to accommodate unique and sub-cluster cases.
机译:聚类是无监督学习的最著名类型之一。评估结果的质量并确定数据中的簇数是一个重要的问题。当前大多数有效性指标仅涵盖集群重要方面的子集。此外,这些索引仅与包含至少两个聚类的数据集相关。本文介绍了一种新的聚类有效性有界索引,称为得分函数(SF)。得分函数基于标准群集属性。几个人造的和真实的数据集用于评估得分函数的性能。针对四个现有有效性指标测试了得分函数。发现在超球状星团的情况下,本文提出的指标总是比这些指标好或好。它显示可以在多维数据集上很好地工作,并且能够适应独特和子集群的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号