Abstract A framework for simulating large scale cloud infrastructures
首页> 外文期刊>Future generation computer systems >A framework for simulating large scale cloud infrastructures
【24h】

A framework for simulating large scale cloud infrastructures

机译:模拟大规模云基础架构的框架

获取原文
获取原文并翻译 | 示例
       

摘要

AbstractCloud infrastructures are continuously growing in size, since more cloud nodes are added to already existing hyper-scale infrastructures. These hyper-scale infrastructures are also becoming heterogeneous as different types of accelerators are added in order to increase performance per watt for certain types of applications and allow for various HPC workloads to migrate to Cloud environments. The introduction of diverse workloads that migrate in the Cloud along with increasing volume of incoming tasks results in phenomena of network congestion, underutilization and resource fragmentation. Simulators are used to analyze, study and possibly improve Cloud environments. However, existing Cloud simulation tools lack the ability to handle heterogeneous resources and tasks that span across multiple Cloud nodes. Moreover, they are mostly sequential and cannot scale to large numbers of Cloud nodes. Furthermore, they do not support over-commitment, which is a common practice in real-world Cloud environments. A framework for simulating large numbers of heterogeneous cloud nodes organized in Cells and executing large numbers of HPC tasks is proposed. The framework is inherently parallel and designed for hybrid distributed memory parallel systems, supporting CPU, memory and network over-commitment. The simulation framework is based on a time advancing loop, allowing dynamic change of the granularity of the simulator and minimizing memory requirements, since data related to the current time-step is stored. Moreover, a latency model for the currency of data in the Gateway Service and Broker is also supported. Implementation details along with discussions concerning the extensibility of the framework are given. Numerical results for simulating large number of heterogeneous resources and incoming tasks are also presented.HighlightsHybrid parallel cloud simulation framework, able to handle millions of resources.The framework is able to handle heterogeneous resources including accelerators.The framework supports execution of tasks spanning across multiple Cloud nodes.Extensible distributed design in order to be able to include new models and components.Parallel time-advancing loop design to reduce memory requirements.
机译: 摘要 由于添加了更多的云节点,云基础架构的规模正在不断增长已经存在的超大规模基础设施。随着增加了不同类型的加速器,这些超大规模基础架构也变得异构,以提高某些类型的应用程序的每瓦性能,并允许各种HPC工作负载迁移到云环境。在云中迁移的各种工作负载的引入以及传入任务数量的增加导致网络拥塞,利用率不足和资源碎片化的现象。模拟器用于分析,研究并可能改善云环境。但是,现有的Cloud仿真工具缺乏处理跨多个Cloud节点的异构资源和任务的能力。此外,它们大多是顺序的,无法扩展到大量的Cloud节点。此外,它们不支持过量使用,这是现实世界中云环境中的常见做法。提出了一种用于模拟单元中组织的大量异构云节点并执行大量HPC任务的框架。该框架本质上是并行的,专为混合分布式内存并行系统而设计,支持CPU,内存和网络的过量使用。由于存储了与当前时间步长相关的数据,因此仿真框架基于时间前进循环,因此可以动态更改模拟器的粒度并最小化内存需求。此外,还支持网关服务和代理中数据流通的延迟模型。给出了实现细节以及有关框架可扩展性的讨论。还提供了用于模拟大量异构资源和传入任务的数值结果。 突出显示 混合并行云模拟框架,能够处理数百万个资源。 该框架能够处理包括加速器在内的各种资源。 •< / ce:label> 该框架支持执行跨多个Cloud节点的任务。 可扩展的分布式设计,以便能够包含新的模型和组件。 并行时间提前循环设计可减少内存需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号