【24h】

Boosting-genetic clustering algorithm

机译:提升遗传聚类算法

获取原文

摘要

K-means is one of the most popular techniques for clustering problem. However, the quality of resulting clusters heavily depends on the selection of initial centroids and may converge to a local optimum rather than global optimum. Genetic algorithm (GA) has been proposed by many researchers to solve a global solution for clustering problem. Even though GA yields higher accuracy result, it is only practical for small datasets. Clustering large datasets with GA is extremely slow or even impossible. In this paper, we proposed a new clustering algorithm, called Boosting-Genetic Clustering Algorithm (BGCA). Inspired by boosting algorithms, the BGCA algorithm combines multiple clustering results on a small number of specially selected samples and iteratively improves the accuracy of the inconsistent regions of data points. Experimental evaluation shows that BGCA yields a higher accuracy than traditional k-means and is very efficient for clustering large datasets.
机译:K-means是聚类问题最受欢迎的技术之一。然而,由此产生的集群的质量大大取决于初始质心的选择,并且可以收敛到局部最佳而不是全球最佳。许多研究人员提出了遗传算法(GA),以解决集群问题的全球解决方案。尽管Ga产生更高的精度结果,但它只是小型数据集是实用的。具有GA的聚类大数据集非常慢甚至不可能。在本文中,我们提出了一种新的聚类算法,称为Boosting-Genetic聚类算法(BGCA)。通过升压算法的启发,BGCA算法将多个聚类结果组合在少量特殊选择的样本上,并且迭代地提高数据点不一致区域的准确性。实验评估表明,BGCA比传统的K-means产生更高的精度,并且对聚类大数据集非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号