首页> 外文学位 >Understanding and Improving Modern Web Traffic Caching.
【24h】

Understanding and Improving Modern Web Traffic Caching.

机译:了解和改进现代Web流量缓存。

获取原文
获取原文并翻译 | 示例

摘要

The WorldWide Web is one of the most popular and important Internet applications, and our daily lives heavily rely on it. Despite its importance, the current Web access is still limited for two reasons: (1) the Web has changed and grown significantly as social networking, video streaming, and file hosting sites have become popular, requiring more and more bandwidth, and (2) the need for Web access also has grown, and many users in bandwidth-limited environments, such as people in the developing world or mobile device users, still suffer from poor Web access.;There was a burst of research a decade ago aimed at understanding the nature of Web traffic and thus improving Web access, but unfortunately, it has dropped off just as the Web has changed significantly. As a result, we have little understanding of the underlying nature of today's Web traffic, and thus miss traffic optimization opportunities for improvingWeb access. To help improveWeb access, this dissertation attempts to fill the missing gap between previous research and today's Web.;For a better understanding of today'sWeb traffic, we first analyze five years (2006-2010) of real Web traffic from a globally-distributed proxy system, which captures the browsing behavior of over 70,000 users from 187 countries. Using this data set, we examine major changes in Web traffic characteristics that occurred during this period. We also develop a new Web page analysis technique that is better suited for modern Web page interactions. Using our analysis technique, we analyze various aspects of page-level changes, and present a simple Web traffic model that we develop based on our findings. Finally, we investigate the redundancy of this traffic, using both traditional object-level caching as well as content-based approaches that use the caching technique at the sub-object or packet level. Among many findings, we observe a huge potential benefit of the content-based caching approaches - the byte hit rate is almost twice as large as that of the traditional object-level caching approach.;Motivated by the possible benefits from content-based caching approaches, we also develop Wanax, a scalable and flexible wide-area network (WAN) accelerator that is designed for low-bandwidth and resource-limited developing world environments. It uses a novel multi-resolution chunking (MRC) scheme that provides high compression rates and high disk performance for a variety of content, while using much less memory than existing approaches. Wanax exploits the design of MRC to perform intelligent load shedding to maximize throughput even when running on resource-limited shared platforms. Finally, Wanax exploits mesh network environments, instead of just the star topologies common in enterprise branch offices. Equally importantly, the designs of Wanax can be applied to enterprise environments, providing the same benefits.
机译:万维网是最流行和最重要的Internet应用程序之一,我们的日常生活严重依赖它。尽管它很重要,但是由于两个原因,当前的Web访问仍然受到限制:(1)随着社交网络,视频流和文件托管站点的流行,Web的变化和发展显着,需要越来越多的带宽;(2)对Web访问的需求也在增长,并且带宽受限的环境中的许多用户(例如发展中国家的人或移动设备用户)仍然遭受较差的Web访问的困扰。十年前爆发了一系列研究,旨在了解Web流量的性质,从而改善了Web访问,但不幸的是,它已随着Web的巨大变化而下降。结果,我们对当今Web流量的基本本质了解甚少,因此错过了用于改进Web访问的流量优化机会。为了帮助改善Web访问,本论文试图填补以前的研究与当今Web之间缺少的空白。为了更好地了解当今的Web流量,我们首先分析了一个五年(2006-2010年)的全球分布的真实Web流量。代理系统,可捕获来自187个国家/地区的70,000多名用户的浏览行为。使用此数据集,我们检查了此期间发生的Web流量特征的主要变化。我们还开发了一种新的网页分析技术,该技术更适合现代网页交互。使用我们的分析技术,我们分析了页面级更改的各个方面,并提出了一个基于发现的简单Web流量模型。最后,我们使用传统的对象级缓存以及在子对象或数据包级使用缓存技术的基于内容的方法来研究此流量的冗余。在许多发现中,我们观察到基于内容的缓存方法具有巨大的潜在好处-字节命中率几乎是传统对象级缓存方法的两倍;;受基于内容的缓存方法可能带来的好处的驱使,我们还开发了Wanax,这是一种可扩展且灵活的广域网(WAN)加速器,旨在用于低带宽和资源有限的发展中国家环境。它使用新颖的多分辨率分块(MRC)方案,可为各种内容提供高压缩率和高磁盘性能,同时使用的内存比现有方法少得多。 Wanax利用MRC的设计来执行智能减载,即使在资源受限的共享平台上运行时也可以最大化吞吐量。最后,Wanax利用网状网络环境,而不仅仅是企业分支机构中常见的星形拓扑。同样重要的是,Wanax的设计可以应用于企业环境,具有相同的优势。

著录项

  • 作者

    Ihm, Sunghwan.;

  • 作者单位

    Princeton University.;

  • 授予单位 Princeton University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 128 p.
  • 总页数 128
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号