...
首页> 外文期刊>Journal of Information Science >Web robot detection based on pattern-matching technique
【24h】

Web robot detection based on pattern-matching technique

机译:基于模式匹配技术的网络机器人检测

获取原文
获取原文并翻译 | 示例
           

摘要

In web robot detection it is important is to find features that are common characteristics of diverse robots, in order to differentiate between them and humans. Existing approaches employ fairly simple features (e.g. empty referrer field, interval between successive requests), which often fail to reflect web robots' behaviour accurately. False alarms may therefore occur unacceptably often. In this paper we propose a fresh approach that expresses the behaviour of interactive users and various web robots in terms of a sequence of request types, called request patterns. Previous proposals have primarily targeted the detection of text crawlers, but our approach works well on many other web robots, such as image crawlers, email collectors and link checkers. A decision tree algorithm proposed by Tan and Kumar was also applied to the same data. A comparison shows that the proposed approach is more accurate, and that real-time detection of web robots is feasible.
机译:在网络机器人检测中,重要的是找到各种机器人的共同特征,以便区分它们和人类。现有方法采用了相当简单的功能(例如,空的引荐来源网址字段,连续请求之间的间隔),这些功能通常无法准确反映网络机器人的行为。因此,错误警报可能经常不可接受地发生。在本文中,我们提出了一种新颖的方法,该方法根据一系列请求类型(称为请求模式)来表达交互式用户和各种Web机器人的行为。先前的建议主要针对文本爬网程序的检测,但是我们的方法在许多其他Web机器人上效果很好,例如图像爬网程序,电子邮件收集器和链接检查器。 Tan和Kumar提出的决策树算法也应用于相同的数据。比较表明,该方法更加准确,并且实时检测网络机器人是可行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号