首页> 外文期刊>Computer Science & Information Technology >Improvement of a Method Based on Hidden Markov Model for Clustering Web Users
【24h】

Improvement of a Method Based on Hidden Markov Model for Clustering Web Users

机译:基于隐马尔可夫模型的Web用户聚类方法的改进

获取原文
           

摘要

Nowadays the determination of the dynamics of sequential data, such as marketing, finance,social sciences or web research has receives much attention from researchers and scholars.Clustering of such data by nature is always a more challenging task. This paper investigates theapplications of different Markov models in web mining and improves a developed method forclustering web users, using hidden Markov models. In the first step, the categorical sequencesare transformed into a probabilistic space by hidden Markov model. Then, in the second step,hierarchical clustering, the performance of clustering process is evaluated with variousdistances criteria. Furthermore this paper shows implementation of the proposed improvementswith symmetric distance measure as Total-Variance and Mahalanobis compared with theprevious use of the proposed method (such as Kullback–Leibler) on the well-known Microsoftdataset with website user search patterns is more clearly result in separate clusters.
机译:如今,诸如营销,金融,社会科学或网络研究之类的顺序数据动力学的确定已引起研究人员和学者的广泛关注。本质上对此类数据进行聚类始终是一项更具挑战性的任务。本文研究了不同的马尔可夫模型在Web挖掘中的应用,并改进了使用隐马尔可夫模型对网络用户进行聚类的方法。第一步,通过隐马尔可夫模型将分类序列转换为概率空间。然后,在第二步,层次聚类中,使用各种距离标准评估聚类过程的性能。此外,本文还表明,与之前在网站用户搜索模式下使用著名的Microsoft数据集(如Kullback-Leibler)使用对称距离测度(如Total-Variance和Mahalanobis)相比,拟议的改进措施的实现更明显,这是单独产生的集群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号