首页> 美国卫生研究院文献>International Scholarly Research Notices >Canonical PSO Based K-Means Clustering Approach for Real Datasets
【2h】

Canonical PSO Based K-Means Clustering Approach for Real Datasets

机译:基于规范PSO的真实数据集K-Means聚类方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

“Clustering” the significance and application of this technique is spread over various fields. Clustering is an unsupervised process in data mining, that is why the proper evaluation of the results and measuring the compactness and separability of the clusters are important issues. The procedure of evaluating the results of a clustering algorithm is known as cluster validity measure. Different types of indexes are used to solve different types of problems and indices selection depends on the kind of available data. This paper first proposes Canonical PSO based K-means clustering algorithm and also analyses some important clustering indices (intercluster, intracluster) and then evaluates the effects of those indices on real-time air pollution database, wholesale customer, wine, and vehicle datasets using typical K-means, Canonical PSO based K-means, simple PSO based K-means, DBSCAN, and Hierarchical clustering algorithms. This paper also describes the nature of the clusters and finally compares the performances of these clustering algorithms according to the validity assessment. It also defines which algorithm will be more desirable among all these algorithms to make proper compact clusters on this particular real life datasets. It actually deals with the behaviour of these clustering algorithms with respect to validation indexes and represents their results of evaluation in terms of mathematical and graphical forms.
机译:“聚类”这项技术的重要性和应用遍及各个领域。聚类是数据挖掘中不受监督的过程,这就是为什么对结果进行正确评估以及衡量聚类的紧凑性和可分离性是重要问题的原因。评估聚类算法结果的过程称为聚类有效性度量。使用不同类型的索引来解决不同类型的问题,并且索引选择取决于可用数据的类型。本文首先提出了基于规范PSO的K-means聚类算法,并分析了一些重要的聚类指标(集群间,集群内),然后使用典型值评估了这些指标对实时空气污染数据库,批发客户,葡萄酒和车辆数据集的影响。 K-means,基于Canonical PSO的K-means,基于简单PSO的K-means,DBSCAN和分层聚类算法。本文还描述了聚类的性质,最后根据有效性评估比较了这些聚类算法的性能。它还定义了在所有这些算法中更希望使用哪种算法在此特定的现实生活数据集上进行适当的紧凑聚类。它实际上是针对验证指标来处理这些聚类算法的行为,并以数学和图形形式表示其评估结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号