首页> 外文会议>International Conference on Fuzzy Systems and Knowledge Discovery >An Improved Partitioning-based Web Documents Clustering Method Combining GA with ISODATA
【24h】

An Improved Partitioning-based Web Documents Clustering Method Combining GA with ISODATA

机译:一种改进的基于段的Web文档聚类方法,其与ISODATA相结合

获取原文

摘要

The existing partitioning-based clustering algorithms, such as k-means, k-medoids and their variations, are simple in theory and fast in convergence speed, but they always just reach local optimum when the iterations terminate and they are not suitable for discovering clusters in the cases when their sizes are very different. This paper proposes an improved Web documents clustering method, using genetic algorithm (GA) which introduces some ideas of ISODATA[6] into the design of its mutation operation. Experiments show that the GA's global search characteristic can avoid local optimum and the ISODATA-based mutation operation makes the improved clustering algorithm have the self-adjusting ability to discover clusters of different sizes.
机译:现有的基于分区的聚类算法,例如k-means,k-myoids及其变化,理论上简单,并且在收敛速度中快速快,但是当迭代终止时,它们总是达到当地的最佳状态,并且它们不适合发现集群在他们的尺寸非常不同的情况下。本文提出了一种改进的Web文档聚类方法,使用遗传算法(GA),它引入了ISODATA [6]的一些思想,进入其突变操作的设计。实验表明,GA的全球搜索特性可以避免局部最佳,基于ISOData的突变操作使得改进的聚类算法具有发现不同尺寸簇的自调节能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号