Web 日志挖掘是将数据挖掘技术应用到Web服务器的日志中,发现Web用户的行为模式.在介绍了典型的数据预处理技术的基础上,指出Frame页面降低了挖掘结果的兴趣性,并提出相应的解决方法-Frame页面过滤算法消除其影响.通过实验数据对该算法进行验证,说明Frame页面过滤算法可以显著地提高Web日志挖掘结果的兴趣性.%Web usage mining is the application of data mining to Web server logs in order to discover the behavior patterus of Web site visitors. After introduction of some typical Web log preprocessing techniques, it is pointed out that the frame pages in a Web site can reduce the interestingness of the result page groups. Then, a frame-filtering algorithm is proposed to solve this problem. Our experiments show that this algorithm can eflectively reveal new interesting page groups, which would not be found without frame filtering.
展开▼