首页> 外文学位 >A Novel Defense Mechanism against Web Crawler Intrusion.
【24h】

A Novel Defense Mechanism against Web Crawler Intrusion.

机译:一种针对Web爬虫入侵的新型防御机制。

获取原文
获取原文并翻译 | 示例

摘要

Web robots also known as crawlers or spiders are used by search engines, hackers and spammers to gather information about web pages. Timely detection and prevention of unwanted crawlers increases privacy and security of websites. In this research, a novel method to identify web crawlers is proposed to prevent unwanted crawler to access websites. The proposed method suggests a five-factor identification process to detect unwanted crawlers. This study provides the pretest and posttest results along with a systematic evaluation of web pages with the proposed identification technique versus web pages without the proposed identification process. An experiment was performed with repeated measures for two groups with each group containing ninety web pages. The outputs of the logistic regression analysis of treatment and control groups confirm the novel five-factor identification process as an effective mechanism to prevent unwanted web crawlers. This study concluded that the proposed five distinct identifier process is a very effective technique as demonstrated by a successful outcome.
机译:搜索引擎,黑客和垃圾邮件发送者使用网络机器人(又称为“爬虫”或“蜘蛛”)来收集有关网页的信息。及时检测和阻止有害的爬虫可提高网站的隐私和安全性。在这项研究中,提出了一种识别Web爬虫的新颖方法,以防止不需要的爬虫访问网站。所提出的方法提出了一种五因素识别过程来检测不需要的爬虫。这项研究提供了测试前和测试后的结果,以及使用建议的识别技术对网页的系统评估以及不使用建议的识别过程的网页的系统评估。对两组进行了重复测量的实验,每组包含90个网页。治疗组和对照组的逻辑回归分析的输出证实了新颖的五因素识别过程是防止不必要的网络爬虫的有效机制。这项研究得出的结论是,所提出的五个独特的识别过程是一项非常有效的技术,并获得了成功的结果。

著录项

  • 作者

    Aghamohammadi, Alireza.;

  • 作者单位

    Eastern Michigan University.;

  • 授予单位 Eastern Michigan University.;
  • 学科 Computer Science.;Information Science.;Information Technology.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号