首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >Improved fast partitional clustering algorithm for text clustering
【24h】

Improved fast partitional clustering algorithm for text clustering

机译:改进的文本群集快速分区聚类算法

获取原文
获取原文并翻译 | 示例
           

摘要

Document clustering has become an important task for processing the big amount of textual information available on the Internet. On the other hand, k-means is the most widely used algorithm for clustering, mainly due to its simplicity and effectiveness. However, k-means becomes slow for large and high dimensional datasets, such as document collections. Recently the FPAC algorithm was proposed to mitigate this problem, but the improvement in the speed was reached at the cost of reducing the quality of the clustering results. For this reason, in this paper, we introduce an improved FPAC algorithm, which, according our experiments on different document collections, allows obtaining better clustering results than FPAC, without highly increasing the runtime.
机译:文档群集已成为处理互联网上可用的大量文本信息的重要任务。 另一方面,K-means是最广泛使用的聚类算法,主要是由于其简单性和有效性。 然而,K-Meanse对于大型和高维数据集(例如文档集)而变慢。 最近,提出了FPAC算法来减轻这个问题,但速度的提高以降低聚类结果的质量的成本达到。 因此,在本文中,我们介绍了一种改进的FPAC算法,根据我们对不同文档集合的实验,允许获得比FPAC更好的聚类结果,而无需高度增加运行时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号