【24h】

Finding Generalized Path Patterns for Web Log Data Mining

机译:查找Web日志数据挖掘的通用路径模式

获取原文
获取原文并翻译 | 示例

摘要

Conducting data mining on logs of web servers involves the determination of frequently occurring access sequences. We examine the problem of finding traversal patterns from web logs by considering the fact that irrelevant accesses to web documents may be interleaved within access patterns due to navigational purposes. We define a general type of pattern that takes into account this fact and also, we present a level-wise algorithm for the determination of these patterns, which is based on the underlying structure of the web site. The performance of the algorithm and its sensitivity to several parameters is examined experimentally with synthetic data.
机译:在Web服务器的日志上进行数据挖掘涉及确定频繁发生的访问顺序。通过考虑以下事实,我们研究了从Web日志中查找遍历模式的问题:由于导航目的,对Web文档的无关访问可能会在访问模式内交错。我们定义了一种考虑到这一事实的通用模式类型,并且,我们基于网站的基础结构,提出了一种确定这些模式的逐级算法。使用合成数据对算法的性能及其对几个参数的敏感性进行了实验检验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号