首页> 外文会议>International Conference on Measurement and Control Engineering >PREPROCESSING OF WEB SERVER LOGS FROM ONLINE NEWSPAPER
【24h】

PREPROCESSING OF WEB SERVER LOGS FROM ONLINE NEWSPAPER

机译:从在线报纸的预处理Web服务器日志

获取原文

摘要

This paper proposes a new method to perform preprocessing in web usage mining. The data used for this experiment is web server logs from an online newspaper in Malaysia. The preprocessing stage consists of data cleaning and user identification. In this project, Python 2.6 is used as the main language to perform the data cleaning operations. Detailed explanation on data cleaning is illustrated, as well as the steps taken to conduct user identification. The results of data cleaning and user identification based on our experiment are also discussed. The output of this study is a log file which has been cleaned, and can be used in the next stage of web usage mining; which is pattern discovery.
机译:本文提出了一种在Web使用挖掘中进行预处理的新方法。用于此实验的数据是来自马来西亚在线报纸的Web服务器日志。预处理阶段包括数据清理和用户识别。在该项目中,Python 2.6用作执行数据清洁操作的主要语言。说明了关于数据清洁的详细说明,以及所采取的步骤进行执行用户识别。还讨论了基于我们的实验的数据清理和用户识别的结果。本研究的输出是已清除的日志文件,可以在Web使用挖掘的下一个阶段使用;这是模式发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号