The Comparison of SOM and K-means for Text Clustering

Yiheng Chen; Bing Qin; Ting Liu; Yuanchao Liu; Sheng Li

首页> 外文期刊>Computer and Information Science >The Comparison of SOM and K-means for Text Clustering

【24h】

The Comparison of SOM and K-means for Text Clustering

机译：文本聚类的SOM和K-means的比较

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

SOM and k-means are two classical methods for text clustering. In this paper some experiments have been done to compare their performances. The sample data used is 420 articles which come from different topics. K-means method is simple and easy to implement; the structure of SOM is relatively complex, but the clustering results are more visual and easy to comprehend. The comparison results also show that k-means is sensitive to initiative distribution, whereas the overall clustering performance of SOM is better than that of k-means, and it also performs well for detection of noisy documents and topology preservation, thus make it more suitable for some applications such as navigation of document collection, multi-document summarization and etc. whereas the clustering results of SOM is sensitive to output layer topology.

机译：SOM和k-means是用于文本聚类的两种经典方法。在本文中，已经进行了一些实验以比较它们的性能。所使用的样本数据是420条来自不同主题的文章。 K-means方法简单易实现; SOM的结构相对复杂，但聚类结果更直观且易于理解。比较结果还表明，k均值对主动分布敏感，而SOM的总体聚类性能优于k均值，并且在噪声文档检测和拓扑保存方面也表现良好，因此使其更适合对于某些应用程序，例如文档集合的导航，多文档摘要等，而SOM的聚类结果对输出层拓扑敏感。

著录项

来源
《Computer and Information Science》 |2010年第2期|共4页
作者
Yiheng Chen; Bing Qin; Ting Liu; Yuanchao Liu; Sheng Li;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. The Comparison of SOM and K-means for Text Clustering [J] . Yiheng Chen, Bing Qin, Ting Liu, Computer and Information Science . 2010,第2期

机译：文本聚类的SOM和K-means的比较
2. Comparison of Distributed K-Means and Distributed Fuzzy C-Means Algorithms for Text Clustering [J] . I Made Artha Agastya, Teguh Bharata Adji, Noor Akhmad Setiawan Communications in Science and Technology . 2017,第1期

机译：文本聚类的分布式K均值和分布式模糊C均值算法的比较
3. DIC-DOC-K-means: Dissimilarity-based Initial Centroid selection for DOCument clustering using K-means for improving the effectiveness of text document clustering [J] . Lakshmi R., Baskar S. Journal of Information Science . 2019,第6期

机译：DIC-DOC-K-means：使用K-means的DOCument聚类基于不相似性的初始质心选择，以提高文本文档聚类的效率
4. Integrating SOM and Fuzzy K-means Clustering for Customer Classification in Personalized Recommendation System for Non-Text based Transactional Data [C] . Sukhpreet Dhaliwal, Ngoc Nhu Van, Manmeet Dhaliwal, International Conference on Information Technology . 2017

机译：集成SOM和模糊K-MEARICELING在个性化基于事务数据的个性化推荐系统中的客户分类
5. Evaluation of Text Document Clustering Using k-Means [D] . Beumer, Lisa. 2020

机译：使用K-Means的文本文档聚类评估
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. The Comparison of SOM and K-means for Text Clustering [O] . Yiheng Chen (corresponding, Bing Qin, Ting Liu, 2015

机译：文本聚类的sOm和K均值比较

The Comparison of SOM and K-means for Text Clustering

摘要

著录项

相似文献

相关主题

期刊订阅