首页> 外文会议>IEEE International Symposium on Parallel Distributed Processing;IPDPS 2009 >Resource monitoring and management with OVIS to enable HPC in cloud computing environments
【24h】

Resource monitoring and management with OVIS to enable HPC in cloud computing environments

机译:使用OVIS进行资源监视和管理,以在云计算环境中启用HPC

获取原文

摘要

Using the cloud computing paradigm, a host of companies promise to make huge compute resources available to users on a pay-as-you-go basis. These resources can be configured on the fly to provide the hardware and operating system of choice to the customer on a large scale. While the current target market for these resources in the commercial space is Web development/hosting, this model has the lure of savings of ownership, operation, and maintenance costs, and thus sounds like an attractive solution for people who currently invest millions to hundreds of millions of dollars annually on high performance computing (HPC) platforms in order to support large-scale scientific simulation codes. Given the current interconnect bandwidth and topologies utilized in these commercial offerings, however, the only current viable market in HPC would be small-memory-footprint embarrassingly parallel or loosely coupled applications, which inherently require little to no inter-processor communication. While providing the appropriate resources (bandwidth, latency, memory, etc.) for the HPC community would increase the potential to enable HPC in cloud environments, this would not address the need for scalability and reliability, crucial to HPC applications. Providing for these needs is particularly difficult in commercial cloud offerings where the number of virtual resources can far outstrip the number of physical resources, the resources are shared among many users, and the resources may be heterogeneous. Advanced resource monitoring, analysis, and configuration tools can help address these issues, since they bring the ability to dynamically provide and respond to information about the platform and application state and would enable more appropriate, efficient, and flexible use of the resources key to enabling HPC. Additionally such tools could be of benefit to non-HPC cloud providers, users, and applications by providing more efficient resource utilization in general.
机译:使用云计算范例,许多公司承诺按使用量付费的方式向用户提供大量的计算资源。这些资源可以动态配置,以向客户大规模提供所选的硬件和操作系统。尽管目前在商业领域中这些资源的目标市场是Web开发/托管,但这种模型具有节省拥有,运营和维护成本的诱惑,因此对于当前投资数百万至数百亿美元的人们来说,这似乎是一个有吸引力的解决方案每年在高性能计算(HPC)平台上花费数百万美元,以支持大规模的科学仿真代码。但是,考虑到这些商业产品中使用的当前互连带宽和拓扑,HPC中唯一当前可行的市场将是小内存占用率,令人尴尬的并行或松耦合应用程序,这些应用程序本质上几乎不需要处理器间通信。虽然为HPC社区提供适当的资源(带宽,延迟,内存等)将增加在云环境中启用HPC的潜力,但这不能满足对HPC应用程序至关重要的可伸缩性和可靠性的需求。在虚拟资源数量远远超过物理资源数量,资源在许多用户之间共享并且资源可能是异构的商业云产品中,满足这些需求特别困难。先进的资源监视,分析和配置工具可以帮助解决这些问题,因为它们带来了动态提供和响应有关平台和应用程序状态的信息的能力,并且将使资源的更适当,高效和灵活使用成为可能。 HPC。此外,此类工具通常可以通过提供更有效的资源利用来使非HPC云提供商,用户和应用程序受益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号