Web Crawler for Event-Driven Crawling of AJAX-Based Web Applications

机译：Web爬网程序，用于基于AJAX的Web应用程序的事件驱动爬网

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a novel technique for crawling Ajax-based applications through "event-driven" crawling in web browsers. The algorithm uses the browser context to analyse the DOM, scans the DOM-tree, detects elements that are capable of changing the state, triggers events on those elements and extracts dynamic DOM content. For illustration, an AJAX web application is utilized as an example to explain the approach. Additionally, the authors implement the concepts and algorithms discussed in this paper in a tool. Finally, the authors report a number of empirical studies in which they apply their approach to a number of representative AJAX applications. The results show that their method has a better performance often with a faster rate of state discovery. The "event-driven" crawling can effectively and accurately crawl dynamic content from Ajax-based applications.

机译：本文介绍了一种通过Web浏览器中的“事件驱动”爬网爬行基于Ajax的应用程序的新技术。该算法使用浏览器上下文来分析DOM，扫描DOM-Tree，检测能够更改状态的元素，触发这些元素的事件并提取动态DOM内容。出于插图，利用Ajax Web应用程序作为解释方法的示例。此外，作者在工具中实现了本文中讨论的概念和算法。最后，作者报告了许多实证研究，它们将其方法应用于许多代表性Ajax应用程序。结果表明，它们的方法通常具有更好的性能，通常具有更快的状态发现率。 “事件驱动”爬网可以有效准确地从基于Ajax的应用程序爬网动态内容。

著录项

来源
《International conference on emerging technologies for information systems, computing, and management》|2013年|191-200|共10页
会议地点
作者
Guoshi Wu; Fanfan Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
AJAX; Event-driven crawling; Web crawler;

机译：AJAX;事件驱动的爬网;网络爬虫;

相似文献

外文文献
中文文献
专利

1. CRAWLING AJAX-BASED WEB APPLICATIONS: EVOLUTION AND STATE-OF-THE-ART [J] . Shah Khalid, Shah Khusro, Irfan Ullah Malaysian Journal of Computer Science . 2018,第1期

机译：爬行基于AJAX的Web应用程序：演化和最新技术
2. Crawling AJAX-Based Web Applications through Dynamic Analysis of User Interface State Changes [J] . ALI MESBAH, ARIE VAN DEURSEN, STEFAN LENSELINK ACM transactions on the web . 2012,第1期

机译：通过动态分析用户界面状态更改来爬行基于AJAX的Web应用程序
3. GeoWeb Crawler: An Extensible and Scalable Web Crawling Framework for Discovering Geospatial Web Resources [J] . Chih-Yuan Huang, Hao Chang ISPRS International Journal of Geo-Information . 2016,第8期

机译：GeoWeb爬网程序：用于发现地理空间Web资源的可扩展和可扩展的Web爬网框架
4. Web Crawler for Event-Driven Crawling of AJAX-Based Web Applications [C] . Guoshi Wu, Fanfan Liu International conference on emerging technologies for information systems, computing, and management . 2013

机译：基于Ajax的Web应用程序的事件驱动爬网的Web爬网
5. Crawling the Web: Discovery and maintenance of large-scale Web data. [D] . Cho, Junghoo. 2002

机译：爬行Web：发现和维护大规模Web数据。
6. An Efficient Approach for Web Indexing of Big Data through Hyperlinks in Web Crawling [O] . R. Suganya Devi, D. Manjula, R. K. Siddharth 2015

机译：通过Web爬网中的超链接对大数据进行Web索引的一种有效方法
7. 0Crawling AJAX-based Web Applications through Dynamic Analysis of User Interface State Changes [O] . Ali Mesbah, Arie Van Deursen, Stefan Lenselink 2015

机译：0通过动态分析用户界面状态更改来抓取基于aJaX的Web应用程序

Web Crawler for Event-Driven Crawling of AJAX-Based Web Applications

摘要

著录项

相似文献

相关主题

期刊订阅