首页> 外文会议>International conference on web engineering >Hidden-Web Induced by Client-Side Scripting: An Empirical Study
【24h】

Hidden-Web Induced by Client-Side Scripting: An Empirical Study

机译:客户端脚本引发的隐藏Web:一项实证研究

获取原文

摘要

Client-side JavaScript is increasingly used for enhancing web application functionality, interactivity, and responsiveness. Through the execution of JavaScript code in browsers, the DOM tree representing a webpage at runtime, can be incrementally updated without requiring a URL change. This dynamically updated content is hidden from general search engines. In this paper, we present the first empirical study on measuring and characterizing the hidden-web induced as a result of client-side JavaScript execution. Our study reveals that this type of hidden-web content is prevalent in online web applications today: from the 500 websites we analyzed, 95% contain client-side hidden-web content; On those websites that contain client-side hidden-web content, (1) on average, 62% of the web states are hidden, (2) per hidden state, there is an average of 19 kilobytes of data that is hidden from which 0.6 kilobytes contain textual content, (3) the DIV element is the most common clickable element used (61%) to initiate this type of hidden-web state transition, and (4) on average 25 minutes is required to dynamically crawl 50 DOM states. Further, our study indicates that there is a correlation between DOM tree size and hidden-web content, but no correlation exists between the amount of JavaScript code and client-side hidden-web.
机译:客户端JavaScript越来越多地用于增强Web应用程序的功能,交互性和响应能力。通过在浏览器中执行JavaScript代码,可以在不更改URL的情况下逐步更新表示运行时网页的DOM树。动态更新的内容对一般搜索引擎而言是隐藏的。在本文中,我们提供了第一项关于测量和表征由客户端JavaScript执行导致的隐藏Web的实证研究。我们的研究表明,这种类型的隐藏Web内容在当今的在线Web应用程序中很普遍:在我们分析的500个网站中,有95%包含客户端隐藏的Web内容;在那些包含客户端隐藏的Web内容的网站上,平均(1),62%的Web状态被隐藏,(2)每个隐藏状态,平均有19 KB的数据被隐藏,其中0.6千字节包含文本内容,(3)DIV元素是最常用的可点击元素(61%),用于启动这种类型的隐藏网络状态转换,并且(4)平均需要25分钟才能动态抓取50个DOM状态。此外,我们的研究表明DOM树大小与隐藏Web内容之间存在相关性,但JavaScript代码量和客户端隐藏Web量之间不存在相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号