首页> 外文会议>Information Retrieval Technology >A Full Distributed Web Crawler Based on StructuredNetwork
【24h】

A Full Distributed Web Crawler Based on StructuredNetwork

机译:基于结构化网络的全分布式Web爬虫

获取原文

摘要

Distributed Web crawlers have recently received more and more attention from researchers. Full decentralized crawler without a centralized managing server seems to be an interesting architectural paradigm for realizing large scale information collecting systems for its scalability, failure resilience and increased autonomy of nodes. This paper provides a novel full distributed Web crawler system which is based on structured network, and a distributed crawling model is developed and applied in it which improves the performance of the system. Some important issues such as assignment of tasks, solution of scalability have been discussed. Finally, an experimental study is used to verify the advantages of system, and the results are comparatively satisfying.
机译:分布式Web搜寻器最近受到了研究人员的越来越多的关注。没有集中式管理服务器的完全分散式爬网程序似乎是一种有趣的体系结构范例,可用于实现大规模信息收集系统,因为它具有可伸缩性,故障弹性和增加的节点自治性。本文提供了一种基于结构化网络的新型全分布式Web爬虫系统,并开发了分布式爬虫模型并在其中进行了应用,以提高系统的性能。讨论了一些重要问题,例如任务分配,可伸缩性解决方案。最后,通过实验研究验证了该系统的优越性,结果令人满意。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号