首页> 外文OA文献 >Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing
【2h】

Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing

机译:在超大型并行机上实现高性能:性能预测和负载均衡

摘要

Parallel machines with an extremely large number of processors (at least tens of thousands processors) are now in operation. For example, the IBM BlueGene/L machine with 128K processors is currently being deployed. It is going to be a significant challenge for application developers to write parallel programs in order to exploit the enormous compute power available and manually scale their applications on such machines. Solving these problems involves finding suitable parallel programming models for such machines and addressing issues like load imbalance. In this thesis, we explore Charm++ programming model and its migratable objects for programming such machines and dynamic load balancing techniques to help parallel applications to easily scale on a large number of processors. We also present a parallel simulator that is capable of predicting parallel performance to help analysis and tuning of the parallel performance and facilitate the development of new load balancing techniques, even before such machines are built.We evaluate the idea of virtualization and its usefulness in helping a programmer to write applications with high degree of parallelism. We demonstrate it by developing several mini-applications with million-way parallelism. We show that Charm++ and AMPI (an extension to MPI) with migratable objects and support for load balancing are suitable programming model for programming very large machines.It is important to understand the performance of parallel applications on very large parallel machines. This thesis explores Parallel Discrete Event Simulation (PDES) techniques with an optimistic synchronization protocol to simulate parallel applications running on a very large number of processors. We optimize the synchronization protocol by exploiting the inherent determinacy that is normally found in parallel applications to reduce the synchronization overhead significantly.
机译:具有大量处理器(至少数万个处理器)的并行计算机现在正在运行。例如,当前正在部署具有128K处理器的IBM BlueGene / L计算机。对于应用程序开发人员而言,编写并行程序以利用可用的巨大计算能力并在此类计算机上手动扩展其应用程序将是一个巨大的挑战。解决这些问题涉及为此类机器找到合适的并行编程模型,并解决诸如负载不平衡之类的问题。在本文中,我们探索了Charm ++编程模型及其可移植对象,以对此类机器和动态负载平衡技术进行编程,以帮助并行应用程序轻松地在大量处理器上进行扩展。我们还提供了一种并行仿真器,该仿真器能够预测并行性能,以帮助分析和调整并行性能,甚至在构建此类负载均衡器之前就促进了新的负载平衡技术的开发。程序员编写高度并行的应用程序。我们通过开发具有百万程并行性的几个小型应用程序来演示它。我们展示了具有可迁移对象的Charm ++和AMPI(对MPI的扩展)以及对负载平衡的支持,是对大型计算机进行编程的合适编程模型。了解大型并行计算机上并行应用程序的性能非常重要。本文探讨了采用乐观同步协议的并行离散事件模拟(PDES)技术,以模拟运行在大量处理器上的并行应用程序。我们利用并行应用程序中通常存在的固有确定性来优化同步协议,以显着减少同步开销。

著录项

  • 作者

    Zheng Gengbin;

  • 作者单位
  • 年度 2005
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号