【24h】

Alternative Approach to Tree-Structured Web Log Representation and Mining

机译:树型Web日志表示和挖掘的替代方法

获取原文

摘要

More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent sub tree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the-art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data.
机译:Web日志数据表示的最新方法旨在捕获有关网站总体结构的用户导航模式。一种这样的表示形式是树状结构的日志文件,这是这项工作的重点。用于分析此类数据的大多数现有方法都是基于使用频繁的子树挖掘技术来提取频繁的用户活动和导航路径。在本文中,我们评估了最近提出的保留用于树状结构数据的平面数据表示的结构所支持的其他标准数据挖掘技术的使用。最初提出的框架已进行了调整,以更好地适应Web日志挖掘任务。在两个真实世界的Web日志数据集上进行了实验评估,并与现有的针对树状结构数据的最新分类器进行了比较。结果表明,该方法在将更广泛的数据挖掘/分析技术应用到树状结构Web日志数据中具有巨大的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号