首页> 外文期刊>Computer and Information Science >The Comparison of SOM and K-means for Text Clustering
【24h】

The Comparison of SOM and K-means for Text Clustering

机译:文本聚类的SOM和K-means的比较

获取原文
       

摘要

SOM and k-means are two classical methods for text clustering. In this paper some experiments have been done to compare their performances. The sample data used is 420 articles which come from different topics. K-means method is simple and easy to implement; the structure of SOM is relatively complex, but the clustering results are more visual and easy to comprehend. The comparison results also show that k-means is sensitive to initiative distribution, whereas the overall clustering performance of SOM is better than that of k-means, and it also performs well for detection of noisy documents and topology preservation, thus make it more suitable for some applications such as navigation of document collection, multi-document summarization and etc. whereas the clustering results of SOM is sensitive to output layer topology.
机译:SOM和k-means是用于文本聚类的两种经典方法。在本文中,已经进行了一些实验以比较它们的性能。所使用的样本数据是420条来自不同主题的文章。 K-means方法简单易实现; SOM的结构相对复杂,但聚类结果更直观且易于理解。比较结果还表明,k均值对主动分布敏感,而SOM的总体聚类性能优于k均值,并且在噪声文档检测和拓扑保存方面也表现良好,因此使其更适合对于某些应用程序,例如文档集合的导航,多文档摘要等,而SOM的聚类结果对输出层拓扑敏感。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号