首页> 中文期刊> 《北京理工大学学报:英文版》 >Similarity matrix-based K-means algorithm for text clustering

Similarity matrix-based K-means algorithm for text clustering

         

摘要

K-means algorithm is one of the most widely used algorithms in the clustering analysis.To deal with the problem caused by the random selection of initial center points in the traditional algorithm,this paper proposes an improved K-means algorithm based on the similarity matrix.The improved algorithm can effectively avoid the random selection of initial center points,therefore it can provide effective initial points for clustering process,and reduce the fluctuation of clustering results which are resulted from initial points selections,thus a better clustering quality can be obtained.The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号