首页> 外文期刊>The international arab journal of information technology >A Novel Mobile Crawler System Based on Filtering off Non-Modified Pages for Reducing Load on the Network
【24h】

A Novel Mobile Crawler System Based on Filtering off Non-Modified Pages for Reducing Load on the Network

机译:基于过滤掉未修改页面以减少网络负载的新型移动爬虫系统

获取原文
获取原文并翻译 | 示例
           

摘要

The studies in the literature show that about 40% of the current Internet traffic and bandwidth consumption is due to web crawlers that retrieve pages for indexing by the different search engines. This traffic and bandwidth consumption will increase in future due to the exponential growth of the web. This paper addresses the problem of bandwidth consumption by introducing an efficient indexing system based on mobile crawlers. The proposed system employs mobile agents to crawl the pages. These mobile agent based crawlers retrieve the pages, process them, compare their data to filter out pages that are not modified after last crawl, and then compress them before sending them to the search engine for indexing. The experimental results of the proposed system are very encouraging.
机译:文献中的研究表明,当前Internet流量和带宽消耗中约40%是由于Web爬网程序检索页面以供不同搜索引擎编制索引所致。由于网络的指数增长,这种流量和带宽消耗在将来会增加。本文通过引入一种基于移动爬网程序的高效索引系统来解决带宽消耗的问题。提出的系统使用移动代理来爬网页面。这些基于移动代理的爬网程序将检索页面,对其进行处理,比较它们的数据以过滤掉上次爬网后未修改的页面,然后对其进行压缩,然后再将其发送到搜索引擎进行索引。该系统的实验结果非常令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号