【24h】

Web usage mining based on probabilistic latent semantic analysis

机译:基于概率潜在语义分析的Web用法挖掘

获取原文
获取外文期刊封面目录资料

摘要

The primary goal of Web usage mining is the discovery of patterns in the navigational behavior of Web users. Standard approaches, such as clustering of user sessions and discovering association rules or frequent navigational paths, do not generally provide the ability to automatically characterize or quantify the unobservable factors that lead to common navigational patterns. It is, therefore, necessary to develop techniques that can automatically discover hidden semantic relationships among users as well as between users and Web objects. Probabilistic Latent Semantic Analysis (PLSA) is particularly useful in this context, since it can uncover latent semantic associations among users and pages based on the co-occurrence patterns of these pages in user sessions. In this paper, we develop a unified framework for the discovery and analysis of Web navigational patterns based on PLSA. We show the flexibility of this framework in characterizing various relationships among users and Web objects. Since these relationships are measured in terms of probabilities, we are able to use probabilistic inference to perform a variety of analysis tasks such as user segmentation, page classification, as well as predictive tasks such as collaborative recommendations. We demonstrate the effectiveness of our approach through experiments performed on real-world data sets.
机译:Web使用挖掘的主要目标是发现Web用户导航行为中的模式。标准方法(例如,用户会话的聚类和发现关联规则或频繁的导航路径)通​​常不提供自动表征或量化导致常见导航模式的不可观察因素的能力。因此,有必要开发一种可以自动发现用户之间以及用户与Web对象之间的隐藏语义关系的技术。概率潜在语义分析(PLSA)在这种情况下特别有用,因为它可以根据用户会话中这些页面的共现模式来发现用户与页面之间的潜在语义关联。在本文中,我们为基于PLSA的Web导航模式的发现和分析开发了一个统一的框架。在表征用户和Web对象之间的各种关系时,我们展示了此框架的灵活性。由于这些关系是根据概率来衡量的,因此我们能够使用概率推断来执行各种分析任务,例如用户细分,页面分类以及预测性任务(例如协作推荐)。我们通过在现实世界的数据集上进行的实验来证明我们的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号