首页> 外文会议>Advances in Knowledge Discovery and Data Mining >Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs
【24h】

Optimal Algorithms for Finding User Access Sessions from Very Large Web Logs

机译:从超大型Web日志中查找用户访问会话的最佳算法

获取原文

摘要

Although efficient identification of user access sessions from very large web logs is an unavoidable data preparation task for the success of higher level web log mining, little attention has been paid to algorithmic study of this problem. In this paper we consider two types of user access sessions, interval sessions and gap sessions. We design two efficient algorithms for finding respectively those two types of sessions with the help of new data structures. We present both theoretical and empirical analysis of the algorithms and prove that both algorithms have optimal time complexity.
机译:尽管从非常大的Web日志中有效识别用户访问会话是成功进行更高级别Web日志挖掘的不可避免的数据准备任务,但对这一问题的算法研究却很少关注。在本文中,我们考虑两种类型的用户访问会话:间隔会话和间隔会话。我们设计了两种有效的算法,借助新的数据结构分别查找这两种类型的会话。我们介绍了算法的理论和经验分析,并证明这两种算法都具有最佳的时间复杂度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号