首页> 外文会议>International Conference on Tools with Artificial Intelligence >Detecting Impolite Crawler by Using Time Series Analysis
【24h】

Detecting Impolite Crawler by Using Time Series Analysis

机译:使用时间序列分析检测不礼貌的履带

获取原文

摘要

Numerous web crawlers especially impolite crawlers visit websites to get contents every day, which yields higher access frequency than the websites can hold. The big traffic of impolite crawlers causes a strong hazard on analysis of normal users and advertisement income. In this paper, we present a method to detect impolite crawlers by using time series analysis. This method is applied to real data of web server logs. Compared with the old methods only using common log attributes as features, the method using time series features improves detection accuracy by at least 20%
机译:许多网络爬虫,尤其是不礼貌的爬虫每天都会访问网站来获取内容,这带来了比网站所能容纳的访问频率更高的访问频率。不礼貌的爬虫流量大,对正常用户和广告收入的分析造成很大的危害。在本文中,我们提出了一种使用时间序列分析来检测不礼貌爬虫的方法。此方法适用于Web服务器日志的真实数据。与仅使用通用日志属性作为特征的旧方法相比,使用时间序列特征的方法将检测精度提高了至少20%

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号