首页> 外文会议>International Conference on Intelligent Human-Machine Systems and Cybernetics;IHMSC >A Web Crawler Detection Algorithm Based on Web Page Member List
【24h】

A Web Crawler Detection Algorithm Based on Web Page Member List

机译:基于网页成员列表的网页爬虫检测算法

获取原文

摘要

Following the widely use of search engines, the impact Web crawlers have on the Web sites should not be ignored. After analyzing the navigational patterns of Web crawlers from Web logs, a new algorithm based on Web page member list is proposed. The algorithm constructs one member list for every Web page and one show table for every visitor. The experiment shows that the new algorithm can detect the unknown crawlers and unfriendly crawlers who do not obey the Standard for Robot Exclusion.
机译:随着搜索引擎的广泛使用,网络爬虫对网站的影响不容忽视。通过从Web日志中分析Web爬虫的导航模式,提出了一种基于Web成员列表的新算法。该算法为每个网页构造一个成员列表,为每个访问者构造一个展示表。实验表明,新算法可以检测出不遵守机器人排除标准的未知爬虫和不友好的爬虫。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号