首页> 外文会议>Proceedings of the international conference on information systems design and intelligent applications 2012 >Design and Implementation of an Effective Web Server Log Preprocessing System
【24h】

Design and Implementation of an Effective Web Server Log Preprocessing System

机译:有效的Web服务器日志预处理系统的设计与实现

获取原文
获取原文并翻译 | 示例

摘要

WWW constitutes huge repository, distributed and dynamically growing hyper medium, supporting access to information and services. As more organizations rely on WWW to conduct business, user behavior analysis becoming difficult in web-based applications. Information about user's interactions with website is stored in server logs and serves as huge electronic survey of website. Web usage mining deals with discovering usage patterns from server logs in order to understand and better serve the needs of web users. The raw information contained in log file represents noisy data. Preprocessing includes cleaning, user identification, sessionization, path completion & structurization and is a prerequisite for improving accuracy and efficiency of the subsequent mining process. This paper emphasizes on an effective web log preprocessing system. Experimental results proved that the proposed system reduces the size of log file down to 12% and improves the performance of preprocessing in identifying users, sessions, path completion and structurization.
机译:WWW构成了庞大的存储库,是分布式且动态增长的超媒体,支持对信息和服务的访问。随着越来越多的组织依靠WWW开展业务,在基于Web的应用程序中,用户行为分析变得越来越困难。有关用户与网站交互的信息存储在服务器日志中,并用作网站的大型电子调查。 Web使用情况挖掘处理从服务器日志中发现使用模式的过程,以便了解并更好地满足Web用户的需求。日志文件中包含的原始信息表示嘈杂的数据。预处理包括清理,用户识别,会话化,路径完成和结构化,这是提高后续采矿过程的准确性和效率的前提。本文着重介绍了一种有效的Web日志预处理系统。实验结果证明,该系统将日志文件的大小减小到12%,并提高了预处理在识别用户,会话,路径完成和结构化方面的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号