A Full Distributed Web Crawler Based on StructuredNetwork

机译：基于结构化网络的全分布式Web爬虫

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Distributed Web crawlers have recently received more and more attention from researchers. Full decentralized crawler without a centralized managing server seems to be an interesting architectural paradigm for realizing large scale information collecting systems for its scalability, failure resilience and increased autonomy of nodes. This paper provides a novel full distributed Web crawler system which is based on structured network, and a distributed crawling model is developed and applied in it which improves the performance of the system. Some important issues such as assignment of tasks, solution of scalability have been discussed. Finally, an experimental study is used to verify the advantages of system, and the results are comparatively satisfying.

机译：分布式Web搜寻器最近受到了研究人员的越来越多的关注。没有集中式管理服务器的完全分散式爬网程序似乎是一种有趣的体系结构范例，可用于实现大规模信息收集系统，因为它具有可伸缩性，故障弹性和增加的节点自治性。本文提供了一种基于结构化网络的新型全分布式Web爬虫系统，并开发了分布式爬虫模型并在其中进行了应用，以提高系统的性能。讨论了一些重要问题，例如任务分配，可伸缩性解决方案。最后，通过实验研究验证了该系统的优越性，结果令人满意。

著录项

来源
《Information Retrieval Technology》|2008年|P.478-483|共6页
会议地点
作者
Kunpeng Zhu; Zhiming Xu; Xiaolong Wang; Yuming Zhao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机设备安全;
关键词
web crawling; full distributed; structured network;

机译：Web爬网;全分布式;结构化网络;

相似文献

外文文献
中文文献
专利

1. An Ontology Based Crawler for Retrieving Information Distributed on the Web [J] . Wael A. Gab–Allah, Ben Bella S. Tawfik, Hamed M. Nassar International Journal of Engineering Research and Applications . 2016,第6期

机译：基于本体的爬虫，用于检索Web上分布的信息
2. Application of Distributed Web Crawlers in Information Management System | Wen | Informatica [J] . Bo Wen Informatica: An International Journal of Computing and Informatics . 2018,第1期

机译：分布式Web爬虫在信息管理系统中的应用。温|信息学
3. Distributed Web Crawlers using Hadoop [J] . Pratiba D., Shobha G., Lalith Kumar H., International Journal of Applied Engineering Research . 2017,第24aPta8期

机译：使用Hadoop分布式Web爬虫器
4. Dis-Dyn Crawler: A Distributed Crawler for Dynamic Web Page [C] . Jianfu Cai, Hua Zhang International Conference on Mechatronics, Materials, Chemistry and Computer Engineering . 2015

机译：DIS-DYN爬网程序：动态网页的分布式爬网
5. Constructing Web Crawlers for the World Art Dynamics Technology Platform [D] . Guo, Xueyuan. 2019

机译：为世界艺术动力学技术平台构建网络爬虫
6. A user-oriented web crawler for selectively acquiring online content in e-health research [O] . Songhua Xu, Hong-Jun Yoon, Georgia Tourassi -1

机译：面向用户的网络爬虫用于在电子卫生研究中选择性地获取在线内容
7. Dis-Dyn Crawler:A Distributed Crawler for Dynamic Web Page [O] . Jianfu Cai, Hua Zhang 2015

机译：DIS-DYN爬网程序：动态网页的分布式爬网

A Full Distributed Web Crawler Based on StructuredNetwork

摘要

著录项

相似文献

相关主题

期刊订阅