Design and implementation of web crawler based on dynamic web collection cycle

机译：基于动态网页采集周期的网页爬虫的设计与实现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The amount of web information is increasing rapidly with advanced wireless networks and emergence of diverse smart devices like i-Phone, i-Pad and so on. The information is continuously being produced and updated in anywhere and anytime by means of easy web platforms, and social networks. Now, it is becoming a hot issue how frequently updated web data has to be refreshed in data integration and retrieval domain. In this paper, we propose dynamic web-data crawling methods, which include sensitive checking of web site changes, and dynamic retrieving of web pages from target web sites. Furthermore, we implemented a java-based web crawling application and compared performance between conventional static approaches and our proposed dynamic ones. Our experiment results showed 59% performance benefits compared to static crawling method

机译：随着先进的无线网络以及诸如i-Phone，i-Pad等各种智能设备的出现，Web信息的数量正在迅速增加。通过便捷的Web平台和社交网络，可以随时随地不断产生和更新信息。现在，如何在数据集成和检索域中频繁刷新更新的Web数据已成为一个热门问题。在本文中，我们提出了动态的Web数据爬网方法，其中包括敏感地检查网站更改以及从目标网站动态检索网页。此外，我们实现了一个基于Java的Web爬网应用程序，并比较了常规静态方法和我们提出的动态方法的性能。我们的实验结果表明，与静态抓取方法相比，性能提高了59％

著录项

来源
《The International Conference on Information Networking 2012》|2012年|p.562-566|共5页
会议地点 Bali(ID)
作者
Kim K. S.; Kim K. Y.; Lee K. H.; Kim T. K.; Cho W. S.;
展开▼
作者单位

CoreEngineering, 643-6 Gack-ri, Ohchang, South Korea;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Clothing Information Collection Based on Theme Web Crawler [J] . Tang Zhi-hang, Li Jun, Zhou Yu-ying International Journal of Advanced Networking and Applications . 2019,第4期

机译：基于主题网络爬虫的服装信息收集
2. The cooperation model for multi-agents and the identification on replicated collections for web crawler [J] . Kai Gao, Shengwang Li International Journal of Modelling, Identification and Control . 2010,第3a4期

机译：多代理协作模型和Web爬网程序复制集合的标识
3. How We Funneled Searchers from Google to Our Collections by Catering to Web Crawlers [J] . MARSHALL BREEDING Computers in Libraries . 2006,第4期

机译：我们如何通过迎合网络爬虫将Google的搜索者引导到我们的收藏集
4. Design and implementation of web crawler based on dynamic web collection cycle [C] . Kim K. S., Kim K. Y., Lee K. H., International Conference on Information Network . 2012

机译：基于动态Web收集周期的Web履带的设计与实现
5. Design and implementation of an intelligent Web crawler for corporate data scraping. [D] . Qin, Xinfeng. 2007

机译：用于企业数据抓取的智能Web搜寻器的设计和实现。
6. Integrating a Web-Based Self-Management Tool (Managing Joint Pain on the Web and Through Resources) for People With Osteoarthritis-Related Joint Pain With a Web-Based Social Network Support Tool (Generating Engagement in Network Involvement): Design Development and Early Evaluation [O] . Paul Clarkson, Ivaylo Vassilev, Anne Rogers, 2020

机译：整合基于网络的自我管理工具（在网络上管理联合疼痛和资源）对具有基于网络的社交网络支持工具的骨关节炎相关的关节疼痛（在网络参与方面的参与）：设计开发和早期评估
7. Design and Implementation of Commodity Information Collection System Based on Web Crawler [O] . 张树鑫 2016

机译：基于Web Crawler的商品信息收集系统的设计与实现

Design and implementation of web crawler based on dynamic web collection cycle

摘要

著录项

相似文献

相关主题

期刊订阅