首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >Capturing user access patterns in the web for data mining
【24h】

Capturing user access patterns in the web for data mining

机译:捕获Web中的用户访问模式以获取数据挖掘

获取原文

摘要

Existing methods for knowledge discovery in the Web are mostly server-oriented and approaches taken are affected by the use of proxy servers. As a result, it is difficult to capture individual Web user behavior from the current log mechanism. Asan effort to remedy this problem, we develop in this paper methods for design and implementation of an access pattern collection server to conduct data mining in the Web. We also devise an innovative method, called page conversion, which converts theoriginal Web pages to enciphered ones so that the devised data collection mechanism will not be deliberately bypassed. With the concept of page conversion, the methods we proposed involves a mechanism of software downloading to resolve the difficultyimposed by proxy servers and to effectively capture the Web user behavior. Using the devised mechanism, traversal patterns are generated and compared to those produced by the ordinary Web servers to validate our results. It is shown that the traversalpatterns resulting from the devised system are not only more informative but also more accurate than those generated by ordinary Web servers, showing the importance and the usefulness of the mechanism devised.
机译:Web中的现有知识发现方法大多是面向服务的,采取的方法受代理服务器的使用影响。结果,难以从当前日志机制捕获单独的网络用户行为。 ASAN努力解决此问题,我们在本文中开发了用于在Web中进行数据挖掘的访问模式收集服务器的设计和实现。我们还规定了一种创新的方法,称为页面转换,将理想网页转换为加密的方法,以便不会故意绕过设计的数据收集机制。通过页面转换的概念,我们提出的方法涉及一种软件下载机制来解决代理服务器难以解析,并有效地捕获Web用户行为。使用设计的机制,生成遍历模式,并与普通Web服务器产生的模式进行比较以验证我们的结果。结果表明,从设计系统产生的traversalpatterns不仅提供更多的信息,但也比普通的Web服务器生成,显示出重要性和设计机构的实用性更加准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号