Design and Implementation of a Scalable Distributed Web Crawler Based on Hadoop

机译：基于Hadoop的可扩展分布式Web爬网履带的设计与实现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, an efficient and scalable distributed web crawler system based on Hadoop will be design and implement. In the paper, firstly the application of cloud computing in reptile field is introduced briefly, and then according to the current status of the crawler system, the specific use of Hadoop distributed and cloud computing features detailed design of a highly scalable crawler system, and finally the system Data statistics, under the same conditions, compared with the existing mature system, it is clear that the superiority of distributed web crawler. This advantage in the context of large data era of massive data is particularly important to climb.

机译：在本文中，基于Hadoop的高效且可扩展的分布式Web爬网履带系统将是设计和实现的。在本文中，首先，简要介绍了爬行动物领域在爬行动物领域的应用，然后根据履带系统的当前状态，特定使用Hadoop分布式和云计算具有高度可扩展的履带系统的详细设计，最后的设计系统数据统计数据统计，与现有的成熟系统相比，显然是分布式Web履带的优越性。在大数据时代的大规模数据的上下文中的这种优势尤为重要，才攀登。

著录项

来源
《International Conference on Big Data Analysis》|2017年|xix 963 p. :|共5页
会议地点
作者
YuLiang Shi; Ti Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-532;
关键词
big data; cloud computing; hadoop; distributed crawler;

机译：大数据;云计算;Hadoop;分布式履带;

相似文献

外文文献
中文文献
专利

1. Distributed Web Crawlers using Hadoop [J] . Pratiba D., Shobha G., Lalith Kumar H., International Journal of Applied Engineering Research . 2017,第24aPta8期

机译：使用Hadoop分布式Web爬虫器
2. Design and Implementation of Distributed Facebook Crawler Based on Interaction Simulation [J] . B.S Satpute, Raj Ambani, RohitRai, International Journal of Engineering Trends and Technology . 2014,第2期

机译：基于交互仿真的分布式Facebook爬虫的设计与实现
3. Design and implementation of distributed RSA algorithm based on Hadoop [J] . Xu Yonglin, Wu Shaofei, Wang Mingqing, Journal of ambient intelligence and humanized computing . 2020,第3期

机译：基于Hadoop的分布式RSA算法的设计与实现
4. Design and Implementation of a Scalable Distributed Web Crawler Based on Hadoop [C] . YuLiang Shi, Ti Zhang International Conference on Big Data Analysis . 2017

机译：基于Hadoop的可扩展分布式Web爬网履带的设计与实现
5. Design and implementation of an intelligent Web crawler for corporate data scraping. [D] . Qin, Xinfeng. 2007

机译：用于企业数据抓取的智能Web搜寻器的设计和实现。
6. Development and Successful Pilot of a Web-based Scaleable Distributed Curriculum Development and Management System for Medical Education [O] . James B. McGee 2003

机译：基于Web的可扩展的分布式医学教育课程开发和管理系统的开发和成功试点
7. Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine [O] . M. Sunil Kumar 2011

机译：Web搜索引擎的可扩展，完全分布式Web爬网程序的设计和实现
8. Distributed design tools: Mapping targeted design tools onto a Web-based distributed architecture for high-performance computing [R] . Holmes, V. P. , Linebarger, J. M. , Miller, D. J. , 1999

机译：分布式设计工具：将目标设计工具映射到基于Web的分布式架构，以实现高性能计算

Design and Implementation of a Scalable Distributed Web Crawler Based on Hadoop

摘要

著录项

相似文献

相关主题

期刊订阅