【24h】

Estimating The Web Robot Population

机译:估算网络机器人人数

获取原文

摘要

In this research, capture-recapture (CR) models are used to estimate the population of web robots based on web server access logs from different websites. Each robot is considered as an individual randomly surfing the web and each website is considered as a trap that records the visitation of robots. We use maximum likelihood estimator to fit the observation data. Results show that there are 3,860 identifiable robot User-Agent strings and 780,760 IP addresses being used by web robots around the world. We also examine the origination of the named robots by their IP addresses. The results suggest that over 50% of web robot IP addresses are from United States and China.
机译:在这项研究中,捕获-捕获(CR)模型用于基于来自不同网站的Web服务器访问日志来估计Web机器人的数量。每个机器人都被视为随机浏览网络的个人,每个网站都被视为记录机器人访问情况的陷阱。我们使用最大似然估计器来拟合观测数据。结果表明,全世界的网络机器人使用了3,860个可识别的机器人User-Agent字符串和780,760个IP地址。我们还将通过其IP地址检查命名机器人的来源。结果表明,超过50%的Web机器人IP地址都来自美国和中国。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号