An Improved K-means Text Clustering Algorithm by Optimizing Initial Cluster Centers

机译：通过优化初始聚类中心改进的K-means文本聚类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

K-means clustering algorithm is an influential algorithm in data mining. The traditional K-means algorithm has sensitivity to the initial cluster centers, leading to the result of clustering depends on the initial centers excessively. In order to overcome this shortcoming, this paper proposes an improved K-means text clustering algorithm by optimizing initial cluster centers. The algorithm first calculates the density of each data object in the data set, and then judge which data object is an isolated point. After removing all of isolated points, a set of data objects with high density is obtained. Afterwards, chooses k high density data objects as the initial cluster centers, where the distance between the data objects is the largest. The experimental results show that the improved K-means algorithm can improve the stability and accuracy of text clustering.

机译：K-means聚类算法是数据挖掘中的一种有影响力的算法。传统的K均值算法对初始聚类中心具有敏感性，导致聚类的结果过于依赖初始聚类中心。为了克服这个缺点，本文提出了一种通过优化初始聚类中心的改进的K-means文本聚类算法。该算法首先计算数据集中每个数据对象的密度，然后判断哪个数据对象是一个孤立点。删除所有孤立点后，将获得一组具有高密度的数据对象。然后，选择k个高密度数据对象作为初始聚类中心，其中数据对象之间的距离最大。实验结果表明，改进的K-means算法可以提高文本聚类的稳定性和准确性。

著录项

来源
《International Conference on Cloud Computing and Big Data》|2016年|265-268|共4页
会议地点
作者
Caiquan Xiong; Zhen Hua; Ke Lv; Xuan Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Algorithm design and analysis; Partitioning algorithms; Heuristic algorithms; Computer science; Optimization methods;

机译：聚类算法;算法设计与分析;分区算法;启发式算法;计算机科学;优化方法;

相似文献

外文文献
中文文献
专利

1. An Improved K-Means Algorithm Based on Initial Clustering Center Optimization [J] . LI Taihao, NAREN Tuya, ZHOU Jianshe, 中兴通讯技术（英文版） . 2017,第0z2期

机译：基于初始聚类中心优化的改进的K均值算法
2. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm, Minimum Spanning Tree, and Hierarchical Clustering in an Applied Study [J] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, Computational and mathematical methods in medicine . 2020,第1期

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法，最小生成树和分层聚类的三种混合方法的比较
3. A GENETIC ALGORITHM FOR OPTIMIZED INITIAL CENTERS K-MEANS CLUSTERING IN SMEs [J] . BAIN KHUSUL KHOTIMAH, FIRLI IRHAMNI, TRI SUNDARWATI Journal of Theoretical and Applied Information Technology . 2016,第1期

机译：中小企业初始中心K-均值聚类的遗传算法
4. An Improved K-means Text Clustering Algorithm by Optimizing Initial Cluster Centers [C] . Caiquan Xiong, Zhen Hua, Ke Lv, International Conference on Cloud Computing and Big Data . 2016

机译：通过优化初始群集中心改进的K-Means文本聚类算法
5. Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms [D] . Singh, Manisha . 2019

机译：K-Means和K-Means ++聚类算法的硬件实现和性能评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Improved K-means Algorithm Based on Optimizing Initial Cluster Centers and Its Application [O] . Xue Linyao, Wang Jianguo 2018

机译：基于优化初始集群中心及其应用的改进的K均值算法

An Improved K-means Text Clustering Algorithm by Optimizing Initial Cluster Centers

摘要

著录项

相似文献

相关主题

期刊订阅