首页> 外文期刊>International Journal of Advanced Computer Research >Web Usage Data Clustering Using Improved Genetic Fuzzy C-Means Algorithm
【24h】

Web Usage Data Clustering Using Improved Genetic Fuzzy C-Means Algorithm

机译:改进的遗传模糊C-均值算法的Web使用数据聚类

获取原文
           

摘要

Web usage mining involves application of data mining techniques to discover usage patterns from the web data. Clustering is one of the important functions in web usage mining. Recent attempts have adapted the C-means clustering algorithm as well as genetic algorithms to find sets of clusters .In this paper; we have proposed a new framework to improve the web sessions' cluster quality from fuzzy c-means clustering using Improved Genetic Algorithm (GA). Initially a fuzzy c-means algorithm is used to cluster the user sessions. The refined initial starting condition allows the iterative algorithm to converge to a "better" local minimum. And in the second step, we have proposed a new GA based refinement algorithm to improve the cluster quality. The proposed algorithm is tested with web access logs collected from the UCI dataset repository
机译:Web使用情况挖掘涉及数据挖掘技术的应用,以从Web数据中发现使用模式。群集是Web使用挖掘中的重要功能之一。最近的尝试已经改编了C均值聚类算法和遗传算法以找到聚类集。我们提出了一个新的框架,以使用改进的遗传算法(GA)从模糊c均值聚类中改善Web会话的聚类质量。最初,使用模糊c均值算法对用户会话进行聚类。改进的初始启动条件允许迭代算法收敛到“更好”的局部最小值。第二步,我们提出了一种新的基于遗传算法的细化算法来提高聚类质量。使用从UCI数据集存储库收集的Web访问日志对提出的算法进行了测试

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号