首页> 外文学位 >Optimizing response time for distributed applications in public clouds.
【24h】

Optimizing response time for distributed applications in public clouds.

机译:优化公共云中分布式应用程序的响应时间。

获取原文
获取原文并翻译 | 示例

摘要

An increasing number of distributed data-driven applications are moving into public clouds. By sharing resources and operating at large scale, public clouds promise higher utilization and lower costs than private clusters. Also, flexible resource allocation and billing methods offered by public clouds enable tenants to control response time or time-to-solution of their applications.;To achieve high utilization, however, cloud providers inevitably place virtual machine instances non-contiguously, i.e., instances of a given application may end up in physically distant machines in the cloud. This allocation strategy leads to significant heterogeneity in average network latency between instances. Also, virtualization and the shared use of network resources between tenants results in network latency jitter. We observe that network latency heterogeneity and jitter in the cloud can greatly increase the time required for communication in these distributed data-driven applications, which leads to significantly worse response time.;To improve response time under latency jitter, we propose a general parallel framework which exposes a high-level, data-centric programming model. We design a jitter-tolerant runtime that exploits this programming model to absorb latency spikes transparently by (1) carefully scheduling computation and (2) replicating data and computation. To improve response time with heterogeneous mean latency, we present ClouDiA, a general deployment advisor that selects application node deployments minimizing either (1) the largest latency between application nodes, or (2) the longest critical path among all application nodes.;We also describe how to effectively control response time for interactive data analytics in public clouds. We introduce Smart, the first elastic cloud resource manager for in-memory interactive data analytics. Smart enables control of the speed of queries by letting users specify the number of compute units per GB of data processed, and quickly reacts to speed changes by adjusting the amount of resources allocated to the user. We then describe SmartShare, an extension of Smart that can serve multiple data scientists simultaneously to obtain additional cost savings without sacrificing query performance guarantees. Taking advantage of the workload characteristics of interactive data analysis, such as think time and overlap between datasets, we are able to further improve resource utilization and reduce cost.
机译:越来越多的分布式数据驱动的应用程序正在迁移到公共云中。通过共享资源并进行大规模操作,公有云比私有集群具有更高的利用率和更低的成本。此外,公共云提供的灵活资源分配和计费方法使租户可以控制其应用程序的响应时间或解决时间。然而,为了实现高利用率,云提供商不可避免地会不连续地放置虚拟机实例,即实例给定应用程序的最终结果可能会出现在云中物理上距离遥远的机器中。这种分配策略导致实例之间的平均网络延迟显着不同。此外,虚拟化和租户之间网络资源的共享使用也会导致网络延迟抖动。我们观察到云中的网络延迟异质性和抖动会大大增加这些分布式数据驱动的应用程序中通信所需的时间,从而导致响应时间显着变差。为了提高延迟抖动下的响应时间,我们提出了一个通用的并行框架公开了一个高级的,以数据为中心的编程模型。我们设计了一个容忍抖动的运行时,该运行时可利用此编程模型来透明地吸收延迟尖峰,方法是:(1)仔细安排计算时间,以及(2)复制数据和计算。为了提高异构平均等待时间的响应时间,我们介绍了通用部署顾问ClouDiA,它选择应用程序节点部署以最小化(1)应用程序节点之间的最大延迟,或(2)所有应用程序节点之间的最长关键路径。描述如何有效控制公共云中交互式数据分析的响应时间。我们介绍Smart,第一个用于内存中交互式数据分析的弹性云资源管理器。 Smart通过让用户指定处理的每GB数据的计算单位数来控制查询速度,并通过调整分配给用户的资源量来快速响应速度变化。然后,我们介绍SmartShare,它是Smart的扩展,可以同时为多个数据科学家提供服务,从而在不牺牲查询性能保证的情况下节省更多成本。利用交互式数据分析的工作负载特征,例如思考时间和数据集之间的重叠,我们能够进一步提高资源利用率并降低成本。

著录项

  • 作者

    Zou, Tao.;

  • 作者单位

    Cornell University.;

  • 授予单位 Cornell University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 185 p.
  • 总页数 185
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号