首页> 外文期刊>International Journal on Computer Science and Engineering >AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS
【24h】

AN IMPROVED AND EFFICIENT HYBRIDIZED K-MEANS CLUSTERING ALGORITHM FOR HIGHDIMENSIONAL DATASET & IT?S PERFORMANCE ANALYSIS

机译:高维数据集的改进高效混合K均值聚类算法及其性能分析

获取原文
           

摘要

In practical life we can see the rapid growth in the various data objects around us, which thereby demands the increase of features and attributes of the data set. This phenomenon, in turn leads to the increase of dimensions of the various data sets. When increase of dimension occurred, the ultimate problem referred to as the ?the curse of dimensionality? comes in to picture. For this reason, in order to mine a high dimensional data set an improved and an efficient dimension reduction technique is very crucial and apparently can be considered as the need of the hour. Numerous methods have been proposed and many experimental analyses have been done to find out an efficient reduction technique so as to reduce the dimension of a high dimensional data set without affecting the original data?s. In this paper we proposed the use of Canonical Variate analysis, which serves the purpose of reducing the dimensions of a high dimensional dataset in a more efficient and effective manner. Then to the reduced low dimensional data set, a clustering technique is applied using a modified k-means clustering. In our paper for the purpose of initializing the initial centroids of the Improved Hybridized K Means clustering algorithm (IHKMCA) we make use of genetic algorithm, so as to get a more accurate result. The results thus found from the proposed work have better accuracy, more efficient and less time complexity as compared to other approaches.
机译:在实际生活中,我们可以看到周围各种数据对象的快速增长,因此需要增加数据集的特征和属性。这种现象继而导致各种数据集的尺寸增加。当尺寸增加时,最终的问题称为“尺寸诅咒”。进入图片。因此,为了挖掘高维数据集,一种改进的高效降维技术非常关键,显然可以认为是小时的需要。已经提出了许多方法,并且已经进行了许多实验分析以找到有效的归约技术,以便在不影响原始数据的情况下减小高维数据集的维数。在本文中,我们建议使用规范变量分析,其目的在于以一种更有效的方式减少高维数据集的维数。然后,对经过简化的低维数据集,使用改进的k均值聚类应用聚类技术。在本文中,为了初始化改进的混合K均值聚类算法(IHKMCA)的初始质心,我们利用遗传算法,以获得更准确的结果。因此,与其他方法相比,从建议的工作中发现的结果具有更好的准确性,更有效的方法和更少的时间复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号