首页> 外文会议>2011 International Conference for High Performance Computing, Networking, Storage and Analysis >Cloud versus in-house cluster: Evaluating Amazon cluster compute instances for running MPI applications
【24h】

Cloud versus in-house cluster: Evaluating Amazon cluster compute instances for running MPI applications

机译:云与内部集群:评估运行MPI应用程序的Amazon集群计算实例

获取原文

摘要

The emergence of cloud services brings new possibilities for constructing and using HPC platforms. However, while cloud services provide the flexibility and convenience of customized, pay-as-you-go parallel computing, multiple previous studies in the past three years have indicated that cloud-based clusters need a significant performance boost to become a competitive choice, especially for tightly coupled parallel applications. In this work, we examine the feasibility of running HPC applications in clouds. This study distinguishes itself from existing investigations in several ways: 1) We carry out a comprehensive examination of issues relevant to the HPC community, including performance, cost, user experience, and range of user activities. 2) We compare an Amazon EC2-based platform built upon its newly available HPC-oriented virtual machines with typical local cluster and supercomputer options, using benchmarks and applications with scale and problem size unprecedented in previous cloud HPC studies. 3) We perform detailed performance and scalability analysis to locate the chief limiting factors of the state-of-the-art cloud based clusters. 4) We present a case study on the impact of per-application parallel I/O system configuration uniquely enabled by cloud services. Our results reveal that though the scalability of EC2-based virtual clusters still lags behind traditional HPC alternatives, they are rapidly gaining in overall performance and cost-effectiveness, making them feasible candidates for performing tightly coupled scientific computing. In addition, our detailed benchmarking and profiling discloses and analyzes several problems regarding the performance and performance stability on EC2.
机译:云服务的出现为构建和使用HPC平台带来了新的可能性。但是,尽管云服务提供了定制的按需付费并行计算的灵活性和便利性,但过去三年中的多项先前研究表明,基于云的集群需要显着的性能提升才能成为竞争性选择,尤其是适用于紧密耦合的并行应用。在这项工作中,我们研究了在云中运行HPC应用程序的可行性。本研究通过以下几种方式将其与现有调查区分开:1)我们对与HPC社区相关的问题进行了全面检查,包括性能,成本,用户体验和用户活动范围。 2)我们将基准测试和应用程序与以前的云HPC研究中前所未有的规模和问题规模进行了比较,将基于Amazon EC2的平台基于其新可用的面向HPC的虚拟机与典型的本地集群和超级计算机选项构建。 3)我们执行详细的性能和可伸缩性分析,以找到基于最新云的集群的主要限制因素。 4)我们提供一个案例研究,说明云服务唯一启用的每个应用程序并行I / O系统配置的影响。我们的结果表明,尽管基于EC2的虚拟集群的可扩展性仍落后于传统的HPC替代方案,但它们在整体性能和成本效益方面正在迅速获得优势,使其成为进行紧密耦合的科学计算的可行候选者。此外,我们详细的基准测试和性能分析揭示并分析了有关EC2的性能和性能稳定性的若干问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号