The Design and Implementation of a High-Efficiency Distributed Web Crawler

机译：高效分布式网络爬虫的设计与实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the rapid development of the Internet, the amount of data on the Internet become more and more huge, and the website technology is constantly changing. Faced with the huge and complex data on the global Internet, how to crawl and use this information has become a major challenge. Traditional stand-alone web crawler is difficult to cope with the challenges brought by the rapid growth of information, and it is difficult to grab huge amounts of data quickly and effectively. In this paper, we research to use the distributed technology to design and implement an efficient, configurable, load balancing and scalable distributed web crawler system.

机译：随着Internet的飞速发展，Internet上的数据量越来越大，网站技术也在不断变化。面对全球互联网上庞大而复杂的数据，如何抓取和使用这些信息已成为一项重大挑战。传统的独立Web爬网程序难以应对信息快速增长所带来的挑战，并且难以快速有效地获取大量数据。在本文中，我们研究使用分布式技术来设计和实现高效，可配置，负载平衡和可扩展的分布式Web爬网程序系统。

著录项

来源
《2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress》|2016年|100-104|共5页
会议地点 Auckland(NZ)
作者
Qiumei Pu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Uniform resource locators; Crawlers; Web pages; Distributed databases; Information filters;

机译：统一资源定位器；搜寻器；网页；分布式数据库；信息过滤器;

相似文献

外文文献
中文文献
专利

1. Object Architected Design and Efficient Dynamic Adjustment Mechanism of Distributed Web Crawlers [J] . Cheng-Hung Tsai, Tsun Ku, Wu-Fan Chien International journal of interdisciplinary telecommunications and networking . 2015,第1期

机译：分布式Web爬虫的对象体系结构设计和有效的动态调整机制
2. Design and Implementation of Distributed Facebook Crawler Based on Interaction Simulation [J] . B.S Satpute, Raj Ambani, RohitRai, International Journal of Engineering Trends and Technology . 2014,第2期

机译：基于交互仿真的分布式Facebook爬虫的设计与实现
3. Application of Distributed Web Crawlers in Information Management System | Wen | Informatica [J] . Bo Wen Informatica: An International Journal of Computing and Informatics . 2018,第1期

机译：分布式Web爬虫在信息管理系统中的应用。温|信息学
4. The Design and Implementation of a High-Efficiency Distributed Web Crawler [C] . Qiumei Pu International conference on Cyber Science and Technology Congress . 2016

机译：高效分布式Web履带的设计与实现
5. Design and implementation of an intelligent Web crawler for corporate data scraping. [D] . Qin, Xinfeng. 2007

机译：用于企业数据抓取的智能Web搜寻器的设计和实现。
6. Design implementation and evaluation of a national campaign to distribute nine million free LLINs to children under five years of age in Tanzania [O] . Kimberly Bonner, Alex Mwita, Peter D McElroy, 2011

机译：设计实施和评估一项全国运动向坦桑尼亚的五岁以下儿童分发900万免费LLIN
7. Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine [O] . M. Sunil Kumar 2011

机译：Web搜索引擎的可扩展，完全分布式Web爬网程序的设计和实现
8. Distributed design tools: Mapping targeted design tools onto a Web-based distributed architecture for high-performance computing [R] . Holmes, V. P. , Linebarger, J. M. , Miller, D. J. , 1999

机译：分布式设计工具：将目标设计工具映射到基于Web的分布式架构，以实现高性能计算

The Design and Implementation of a High-Efficiency Distributed Web Crawler

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅