AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS

Prof H.S Behera; Rosly Boy Lingdoh; Diptendra Kodamasingh

首页> 外文期刊>International Journal on Computer Science and Engineering >AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS

【24h】

AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS

机译：高维数据集的改进高效混合K均值聚类算法及其性能分析

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In practical life we can see the rapid growth in the various data objects around us, which thereby demands the increase of features and attributes of the data set. This phenomenon, in turn leads to the increase of dimensions of the various data sets. When increase of dimension occurred, the ultimate problem referred to as the ?the curse of dimensionality? comes in to picture. For this reason, in order to mine a high dimensional data set an improved and an efficient dimension reduction technique is very crucial and apparently can be considered as the need of the hour. Numerous methods have been proposed and many experimental analyses have been done to find out an efficient reduction technique so as to reduce the dimension of a high dimensional data set without affecting the original data?s. In this paper we proposed the use of Canonical Variate analysis, which serves the purpose of reducing the dimensions of a high dimensional dataset in a more efficient and effective manner. Then to the reduced low dimensional data set, a clustering technique is applied using a modified k-means clustering. In our paper for the purpose of initializing the initial centroids of the Improved Hybridized K Means clustering algorithm (IHKMCA) we make use of genetic algorithm, so as to get a more accurate result. The results thus found from the proposed work have better accuracy, more efficient and less time complexity as compared to other approaches.

机译：在实际生活中，我们可以看到周围各种数据对象的快速增长，因此需要增加数据集的特征和属性。这种现象继而导致各种数据集的尺寸增加。当尺寸增加时，最终的问题称为“尺寸诅咒”。进入图片。因此，为了挖掘高维数据集，一种改进的高效降维技术非常关键，显然可以认为是小时的需要。已经提出了许多方法，并且已经进行了许多实验分析以找到有效的归约技术，以便在不影响原始数据的情况下减小高维数据集的维数。在本文中，我们建议使用规范变量分析，其目的在于以一种更有效的方式减少高维数据集的维数。然后，对经过简化的低维数据集，使用改进的k均值聚类应用聚类技术。在本文中，为了初始化改进的混合K均值聚类算法（IHKMCA）的初始质心，我们利用遗传算法，以获得更准确的结果。因此，与其他方法相比，从建议的工作中发现的结果具有更好的准确性，更有效的方法和更少的时间复杂性。

著录项

来源
《International Journal on Computer Science and Engineering》 |2011年第3期|共8页
作者
Prof H.S Behera; Rosly Boy Lingdoh; Diptendra Kodamasingh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Genetic Algorithm Based Dimensionality Reduction for Improving Performance of K-Means Clustering: A Case Study for Categorization of Medical Dataset [J] . Asha Gowda Karegowda, Vidya T. Shama, M.A. Jayaram, International journal of soft computing . 2012,第5期

机译：基于遗传算法的降维方法提高K-Means聚类性能：以医学数据集分类为例
2. Genetic Algorithm Based Dimensionality Reduction for Improving Performance of K-Means Clustering: A Case Study for Categorization of Medical Dataset [J] . Asha Gowda Karegowda, Vidya T. Shama, M.A. Jayaram, International journal of soft computing . 2012,第5期

机译：基于遗传算法的降维方法提高K-Means聚类性能：以医学数据集分类为例
3. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
4. An Improved Clustering Algorithm Based on k-Means and Artificial Bee Colony Optimization for Datasets that Contain Outliers [C] . Anu Balachandran, K.A. Abdul Nazeer 2018 International Conference on Computing, Power and Communication Technologies . 2018

机译：包含离群值的数据集的基于k均值和人工蜂群优化的改进聚类算法
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
8. Sampling Within k-Means Algorithm to Cluster Large Datasets [R] . Bejarano, J., Bose, K., Brannan, T., 2011

机译：在k-means算法中采样以聚类大数据集

AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS

摘要

著录项

相似文献

相关主题

期刊订阅