首页> 外文期刊>International Journal of Engineering and Technology >Pre Processing of Web Logs ? An Improved Approach For E-Commerce Websites
【24h】

Pre Processing of Web Logs ? An Improved Approach For E-Commerce Websites

机译:Web日志的预处理?一种改进的电子商务网站方法

获取原文
           

摘要

In this paper an improved approach for pre processing of web logs data has been proposed and evaluated so that it can be applied for web logs of e-commerce web sites. The resultant web log data after these pre processing steps can be used for further pattern discovery and analysis that helps to provide useful prediction to enhance e-commerce. Ideally, the input for the Web Usage Mining process is a user session file that gives an exact account of who accessed the web site, what pages were requested and in what order, and how long each page was viewed. A user session is the set of the page accesses that occur during a single visit to a web site by a web user. However, the information contained in a raw web server log does not reliably represent a user session file before data pre processing. Hence, data pre processing plays an important role in web usage mining applications. The data preparation process is often the most time consuming and computationally intensive step in the web usage mining process. The scope of this work is to enhance existing pre processing techniques for user and session identification that makes the web log data ready to use. This research work proposes a time-oriented and web ontology based user session identification algorithm which is found to be effective than the existing pre-processing approaches considering the run time, memory usage and processing complexity factors.
机译:在本文中,已经提出并评估了一种改进的预处理Web日志数据的方法,以便可以将其应用于电子商务网站的Web日志。经过这些预处理步骤后得到的Web日志数据可用于进一步的模式发现和分析,有助于提供有用的预测以增强电子商务。理想情况下,Web用法挖掘过程的输入是一个用户会话文件,该文件可以准确说明谁访问了该网站,请求了哪些页面,按什么顺序以及每个页面被查看了多长时间。用户会话是Web用户单次访问网站期间发生的一组页面访问。但是,原始Web服务器日志中包含的信息不能可靠地表示数据预处理之前的用户会话文件。因此,数据预处理在Web使用挖掘应用程序中起着重要作用。数据准备过程通常是Web使用情况挖掘过程中最耗时且计算最密集的步骤。这项工作的范围是增强用于用户和会话标识的现有预处理技术,从而使Web日志数据可以立即使用。这项研究工作提出了一种基于时间和Web本体的面向用户的会话识别算法,该算法被认为比考虑运行时间,内存使用和处理复杂性因素的现有预处理方法更为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号