首页> 外文会议>Workshop on Component-Based High Performance Computing >The content and access dynamics of a busy Web site
【24h】

The content and access dynamics of a busy Web site

机译:繁忙网站的内容和访问动态

获取原文

摘要

In this paper, we study the dynamics of the MSNBC news site, one of the busiest Web sites in the Internet today. Unlike many other efforts that have analyzed client accesses as seen by proxies, we focus on the server end. We analyze the dynamics of both the server content and client accesses made to the server. The former considers the content creation and modification process while the latter considers page popularity and locality in client accesses. Some of our key results are: (a) files tend to change little when they are modified, (b) a small set of files tends to get modified repeatedly, (c) file popularity follows a Zipf-like distribution with a parameter &agr that is much larger than reported in previous, proxy-based studies, and (d) there is significant temporal stability in file popularity but not much stability in the domains from which clients access the popular content. We discuss the implications of these findings for techniques such as Web caching (including cache consistency algorithms), and prefetching or server-based ``push'' of Web content.

机译:

在本文中,我们研究了MSNBC新闻站点的动态,MSNBC新闻站点是当今Internet上最繁忙的网站之一。与分析代理访问客户端访问的许多其他工作不同,我们专注于服务器端。我们分析服务器内容和对服务器进行的客户端访问的动态。前者考虑内容的创建和修改过程,而后者考虑客户端访问中的页面受欢迎程度和本地性。我们的一些主要结果是:(a)文件在被修改时往往变化不大,(b)一小组文件往往会被反复修改,(c)文件的流行遵循类似Zipf的分布,且参数&agr为比以前基于代理的研究报告的要大得多,并且(d)文件受欢迎程度在时间上具有显着的稳定性,但在客户端访问受欢迎内容的域中没有太大的稳定性。我们讨论了这些发现对Web缓存(包括缓存一致性算法)以及Web内容的预取或基于服务器的``推送''等技术的含义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号