首页> 外文会议>Exploiting the Knowledge Base: Applications of Rule Based Control >webSPADE: a parallel sequence mining algorithm to analyze web log data
【24h】

webSPADE: a parallel sequence mining algorithm to analyze web log data

机译:webSPADE:一种并行序列挖掘算法,用于分析Web日志数据

获取原文
获取原文并翻译 | 示例

摘要

Enterprise-class web sites receive a large amount of traffic, from both registered and anonymous users. Data warehouses are built to store and help analyze the click streams within this traffic to provide companies with valuable insights into the behavior of their customers. This article proposes a parallel sequence mining algorithm, webSPADE, to analyze the click streams found in site web logs. In this process, raw web logs are first cleaned and inserted into a data warehouse. The click streams are then mined by webSPADE. An innovative web-based front-end is used to visualize and query the sequence mining results. The webSPADE algorithm is currently used by Verizon to analyze the daily traffic of the Verizon.com web site.
机译:企业级网站从注册用户和匿名用户那里都收到大量流量。数据仓库的建立是为了存储和帮助分析流量中的点击流,从而为公司提供有关其客户行为的宝贵见解。本文提出了一种并行序列挖掘算法webSPADE,用于分析在站点Web日志中找到的点击流。在此过程中,首先清理原始Web日志并将其插入到数据仓库中。然后,通过webSPADE挖掘点击流。创新的基于Web的前端用于可视化和查询序列挖掘结果。 Verizon当前使用webSPADE算法来分析Verizon.com网站的每日流量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号