首页> 外文学位 >Evaluation and optimization of turnaround time and cost of HPC applications on the cloud.
【24h】

Evaluation and optimization of turnaround time and cost of HPC applications on the cloud.

机译:评估和优化云上HPC应用程序的周转时间和成本。

获取原文
获取原文并翻译 | 示例

摘要

The popularity of Amazon's EC2 cloud platform has increased in commercial and scientific high-performance computing (HPC) applications domain in recent years. However, many HPC users consider dedicated high-performance clusters, typically found in large compute centers such as those in national laboratories, to be far superior to EC2 because of significant communication overhead of the latter. We find this view to be quite narrow and the proper metrics for comparing high-performance clusters to EC2 is turnaround time and cost..;In this work, we first compare the HPC-grade EC2 cluster to top-of-the-line HPC clusters based on turnaround time and total cost of execution. When measuring turnaround time, we include expected queue wait time on HPC clusters. Our results show that although as expected, standard HPC clusters are superior in raw performance, they suffer from potentially significant queue wait times. We show that EC2 clusters may produce better turnaround times due to typically lower wait queue times. To estimate cost, we developed a pricing model---relative to EC2's node-hour prices---to set node-hour prices for (currently free) HPC clusters. We observe that the cost-effectiveness of running an application on a cluster depends on raw performance and application scalability.;However, despite the potentially lower queue wait and turnaround times, the primary barrier to using clouds for many HPC users is the cost. Amazon EC2 provides a fixed-cost option (called on-demand) and a variable-cost, auction-based option (called the spot market). The spot market trades lower cost for potential interruptions that necessitate checkpointing; if the market price exceeds the bid price, a node is taken away from the user without warning. We explore techniques to maximize performance per dollar given a time constraint within which an application must complete. Specifically, we design and implement multiple techniques to reduce expected cost by exploiting redundancy in the EC2 spot market. We then design an adaptive algorithm that selects a scheduling algorithm and determines the bid price. We show that our adaptive algorithm executes programs up to 7x cheaper than using the on-demand market and up to 44% cheaper than the best non-redundant, spot-market algorithm. Finally, we extend our adaptive algorithm to exploit several opportunities for cost-savings on the EC2 spot market. First, we incorporate application scalability characteristics into our adaptive policy. We show that the adaptive algorithm informed with scalability characteristics of applications achieves up to 56% cost-savings compared to the expected cost for the base adaptive algorithm run at a fixed, user-defined scale. Second, we demonstrate potential for obtaining considerable free computation time on the spot market enabled by its hour-boundary pricing model.
机译:近年来,在商业和科学高性能计算(HPC)应用领域中,Amazon EC2云平台的普及程度有所提高。但是,许多HPC用户认为专用的高性能集群(通常在大型计算中心(如国家实验室的集群)中找到)比EC2优越得多,因为后者的通信开销很大。我们发现这种观点非常狭窄,将高性能集群与EC2进行比较的合适指标是周转时间和成本。.;在这项工作中,我们首先将HPC级EC2集群与顶级HPC进行了比较。基于周转时间和总执行成本的集群。在测量周转时间时,我们包括了HPC群集上的预期队列等待时间。我们的结果表明,尽管正如预期的那样,标准HPC群集在原始性能方面表现优异,但它们可能会遭受大量的队列等待时间。我们显示,由于通常减少等待队列时间,因此EC2群集可能会产生更好的周转时间。为了估算成本,我们开发了一种定价模型-相对于EC2的节点小时价格-来设置(当前免费的)HPC群集的节点小时价格。我们观察到在群集上运行应用程序的成本效益取决于原始性能和应用程序可伸缩性。但是,尽管等待和周转时间可能会减少,但许多HPC用户使用云的主要障碍是成本。 Amazon EC2提供固定成本选项(称为按需)和可变成本,基于拍卖的选项(称为现货市场)。现货市场以较低的成本交易可能需要检查点的潜在中断。如果市场价格超过买入价,则将节点从用户手中夺走,而不会发出警告。我们探索在给定时间限制的情况下最大化每美元性能的技术,在此时间内必须完成应用程序。具体来说,我们设计和实施多种技术,以通过利用EC2现货市场中的冗余来降低预期成本。然后,我们设计一种自适应算法,该算法选择调度算法并确定出价。我们证明,与使用按需市场相比,我们的自适应算法执行的程序便宜多达7倍,并且比最好的非冗余现货市场算法便宜多达44%。最后,我们扩展了自适应算法,以利用EC2现货市场上的多种节省成本的机会。首先,我们将应用程序可伸缩性特征整合到我们的自适应策略中。我们证明,与以固定的用户定义规模运行的基本自适应算法的预期成本相比,具有应用程序可伸缩性特征的自适应算法可节省多达56%的成本。其次,我们证明了其小时边界定价模型在现货市场上获得大量免费计算时间的潜力。

著录项

  • 作者

    Marathe, Aniruddha.;

  • 作者单位

    The University of Arizona.;

  • 授予单位 The University of Arizona.;
  • 学科 Computer Science.;Web Studies.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 108 p.
  • 总页数 108
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号