Canonical PSO Based K-Means Clustering Approach for Real Datasets

机译：基于规范PSO的真实数据集K-Means聚类方法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

“Clustering” the significance and application of this technique is spread over various fields. Clustering is an unsupervised process in data mining, that is why the proper evaluation of the results and measuring the compactness and separability of the clusters are important issues. The procedure of evaluating the results of a clustering algorithm is known as cluster validity measure. Different types of indexes are used to solve different types of problems and indices selection depends on the kind of available data. This paper first proposes Canonical PSO based K-means clustering algorithm and also analyses some important clustering indices (intercluster, intracluster) and then evaluates the effects of those indices on real-time air pollution database, wholesale customer, wine, and vehicle datasets using typical K-means, Canonical PSO based K-means, simple PSO based K-means, DBSCAN, and Hierarchical clustering algorithms. This paper also describes the nature of the clusters and finally compares the performances of these clustering algorithms according to the validity assessment. It also defines which algorithm will be more desirable among all these algorithms to make proper compact clusters on this particular real life datasets. It actually deals with the behaviour of these clustering algorithms with respect to validation indexes and represents their results of evaluation in terms of mathematical and graphical forms.

机译：“聚类”这项技术的重要性和应用遍及各个领域。聚类是数据挖掘中不受监督的过程，这就是为什么对结果进行正确评估以及衡量聚类的紧凑性和可分离性是重要问题的原因。评估聚类算法结果的过程称为聚类有效性度量。使用不同类型的索引来解决不同类型的问题，并且索引选择取决于可用数据的类型。本文首先提出了基于规范PSO的K-means聚类算法，并分析了一些重要的聚类指标（集群间，集群内），然后使用典型值评估了这些指标对实时空气污染数据库，批发客户，葡萄酒和车辆数据集的影响。 K-means，基于Canonical PSO的K-means，基于简单PSO的K-means，DBSCAN和分层聚类算法。本文还描述了聚类的性质，最后根据有效性评估比较了这些聚类算法的性能。它还定义了在所有这些算法中更希望使用哪种算法在此特定的现实生活数据集上进行适当的紧凑聚类。它实际上是针对验证指标来处理这些聚类算法的行为，并以数学和图形形式表示其评估结果。

著录项

期刊名称 International Scholarly Research Notices
作者
Lopamudra Dey; Sanjay Chakraborty;
展开▼
作者单位

展开▼
年(卷),期 2014(2014),-1
年度 2014
页码 414013
总页数 11
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Canonical PSO BasedK-Means Clustering Approach for Real Datasets [J] . LopamudraDey, SanjayChakraborty International Scholarly Research Notices . 2014,第1期

机译：基于规范PSO的K-Means真实数据集聚类方法
2. A hybrid reciprocal model of PCA and K-means with an innovative approach of considering sub-datasets for the improvement of K-means initialization and step-by-step labeling to create clusters with high interpretability [J] . Anaraki Seyed Alireza Mousavian, Haeri Abdorrahman, Moslehi Fateme Pattern Analysis and Applications . 2021,第3期

机译：具有创新方法的PCA和K-in的混合互惠模型，其考虑子数据集改进K-Means初始化和逐步标记，以创建具有高可解释性的群集
3. A fast and effective partitional clustering algorithm for large categorical datasets using a k-means based approach [J] . Ben Salem Semeh, Naouali Sami, Chtourou Zied Computers and Electrical Engineering . 2018,第期

机译：基于K-Meancy的方法的大型分类数据集的快速有效的分区聚类算法
4. K-Means clustering GAN based Fault Diagnosis Approach for Imbalanced Dataset [C] . Huifang Li, Rui Fan, Qisong Shi, International Symposium on Computational Intelligence and Industrial Applications;Beijing Association of Automation;Beijing Institute of Technology . -1

机译：K-means基于GAAL基于GAN的IMBalive DataSet的故障诊断方法
5. Clustered Disclosure upon Scheduled Macro News Announcements: A Real-Option Based Approach [D] . Hu, Xiaoli. 2018

机译：计划的宏新闻公告中的聚集披露：一种基于实物期权的方法
6. Classification of Two Class Motor Imagery Tasks Using Hybrid GA-PSO Based K-Means Clustering [O] . Suraj, Purnendu Tiwari, Subhojit Ghosh, 2015

机译：基于混合GA-PSO的K均值聚类对两类运动图像任务进行分类
7. Canonical PSO Based k-Means Clustering Approach for Real Datasets [O] . Dey, Lopamudra, Chakraborty, Sanjay 2014

机译：基于规范psO的实数数据集k-means聚类方法

Canonical PSO Based K-Means Clustering Approach for Real Datasets

摘要

著录项

相似文献

相关主题

期刊订阅