首页> 外文OA文献 >New contributions for modeling and simulating high performance computing applications on parallel and distributed architectures
【2h】

New contributions for modeling and simulating high performance computing applications on parallel and distributed architectures

机译:在并行和分布式体系结构上建模和模拟高性能计算应用程序的新贡献

摘要

In this thesis we propose a new simulation platform specifically designed for modeling parallel and distributed architectures, which consists on integrating the model of the four basic systems into a single simulation platform. Those systems consist of storage system, memory system, processing system and network system. The main characteristics of this platform are flexibility, to embrace the widest range of possible designs; scalability, to check the limits of extending the architecture designs; and the necessary trade-offs between the execution time and the accuracy obtained. This simulation platform is aimed to model both existent and new designs of HPC architectures and applications. Then, depending on the user's requirements, the model can be focused on a set of the basic systems, or by the contrary on the complete system. Therefore, a complete distributed system can be modeled by integrating those basic systems in the model, each one with the corresponding level of detail, which provides a high level of flexibility. Moreover, it provides a good compromise between accuracy and performance, and flexibility provided for building a wide range of architectures with different configurations. A validation process of the proposed simulation platform has been fulfilled by comparing the results obtained in real architectures with those obtained in the analogous simulated environments. Furthermore, in order to evaluate and analyze how evolve both scalability and bottlenecks existent on a typical HPC multi-core architecture using different configurations, a set of experiments have been achieved. Basically those experiments consist on executing the two application models (HPC and checkpointing applications) in several HPC architectures. Finally, performance results of the simulation itself for executing the corresponding experiments have been achieved. The main purpose of this process is to calculate both the amount of time and memory needed for executing a specific simulation, depending of the size of the environment to be modeled, and the hardware resources available for executing each simulation. ----------------------------------------------------------------------------------------------------------------------------------------------------------
机译:在本文中,我们提出了一个专门为并行和分布式体系结构建模而设计的新仿真平台,该平台将四个基本系统的模型集成到一个仿真平台中。这些系统由存储系统,内存系统,处理系统和网络系统组成。该平台的主要特点是灵活性,可以涵盖最广泛的可能设计。可扩展性,以检查扩展架构设计的限制;以及执行时间和获得的精度之间的必要权衡。该仿真平台旨在为HPC体系结构和应用程序的现有设计和新设计建模。然后,根据用户的要求,模型可以集中在一组基本系统上,或者相反,集中在整个系统上。因此,可以通过将模型中的那些基本系统集成到一个完整的分布式系统中来建模,每个基本系统具有相应的详细程度,从而提供了高度的灵活性。此外,它在准确性和性能以及灵活性之间提供了很好的折衷,从而为构建具有不同配置的各种体系结构提供了灵活性。通过将在实际架构中获得的结果与在类似的模拟环境中获得的结果进行比较,已完成了所提出的仿真平台的验证过程。此外,为了评估和分析使用不同配置的典型HPC多核架构上可扩展性和瓶颈的发展方式,已经完成了一组实验。基本上,这些实验包括执行几种HPC架构中的两个应用程序模型(HPC和检查点应用程序)。最终,获得了用于执行相应实验的仿真本身的性能结果。此过程的主要目的是根据要建模的环境的大小以及可用于执行每个模拟的硬件资源,来计算执行特定模拟所需的时间和内存。 -------------------------------------------------- -------------------------------------------------- -------------------------------------------------- ----

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号