首页> 外文会议>IEEE International Conference on Big Data >Trilogy: Data placement to improve performance and robustness of cloud computing
【24h】

Trilogy: Data placement to improve performance and robustness of cloud computing

机译:三部曲:数据放置以提高云计算的性能和健壮性

获取原文

摘要

Infrastructure as a Service, one of the most disruptive aspects of cloud computing, enables configuring a cluster for each application for each workload. When the workload changes, a cluster will be either underutilized (wasting resources) or unable to meet demand (incurring opportunity costs). Consequently, efficient cluster resizing requires proper data replication and placement. Our work reveals that coarse-grain, workload-aware replication addresses over-utilization but cannot resolve under-utilization. With fine-grain partitioning of the dataset, data replication can reduce both under- and over-utilization. In our empirical studies, compared to a näive uniform data replication a coarse-grain workload-aware replication increases throughput by 81% on a highly-skewed workload. A fine-grain scheme further reaches 166% increase. Furthermore, a surprisingly small increase in granularity is sufficient to obtain most benefits. Evaluations also show that maximizing the number of unique partitions per node increases robustness to tolerate workload deviation while minimizing this number reduces storage footprint.
机译:基础架构即服务是云计算最具破坏性的方面之一,可为每个应用程序为每个工作负载配置群集。当工作负载发生变化时,集群将得到充分利用(浪费资源)或无法满足需求(导致机会成本)。因此,有效的群集大小调整需要正确的数据复制和放置。我们的工作表明,粗粒度,可感知工作负载的复制可以解决过度利用的问题,但不能解决利用不足的问题。通过对数据集进行细粒度分区,数据复制可以减少利用率不足和过度利用的情况。在我们的经验研究中,与天真的统一数据复制相比,粗粒度的工作负载感知复制在高度倾斜的工作负载上将吞吐量提高了81%。细粒度方案进一步达到166%的增长。此外,粒度出乎意料的小幅增加足以获得最大的收益。评估还显示,最大化每个节点的唯一分区数可以提高鲁棒性以容忍工作负载偏差,而最小化此数目则可以减少存储占用空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号