首页> 外文学位 >Achieving high performance on extremely large parallel machines: Performance prediction and load balancing.
【24h】

Achieving high performance on extremely large parallel machines: Performance prediction and load balancing.

机译:在超大型并行机上实现高性能:性能预测和负载平衡。

获取原文
获取原文并翻译 | 示例

摘要

Parallel machines with an extremely large number of processors are now in operation. For example, the IBM BlueGene/L machine with 128K processors is currently being deployed. It is going to be a significant challenge for application developers to write parallel programs in order to exploit the enormous compute power available and manually scale their applications on such machines. Solving these problems involves finding suitable parallel programming models for such machines and addressing issues like load imbalance. This thesis explores processor virtualization in Charm++ programming model and employing migratable objects for programming petaflops class machines supported by parallel emulation for algorithm validation, parallel simulation for performance prediction, and using new kinds of automatic load balancing strategies to substantially address many of these challenges for programming very large machines.; It is important to understand the performance of parallel applications on very large parallel machines. This thesis explores Parallel Discrete Event Simulation techniques to simulate parallel applications and predict their performance. We present a novel optimistic synchronization protocol which exploits the inherent determinacy in parallel applications to effectively reduce the synchronization overhead.; Load balance problem presents significant challenges to applications to achieve scalability on very large machines. We study load balancing techniques and develop a spectrum of load balancing strategies motivated by several real-world applications. We optimize our load balancing strategies in multiple dimensions of criteria such as communication-aware load balancing, sub-step load balancing, and computation phase-aware load balancing. We have successfully scaled NAMD (a classical molecular dynamics application) to 1TF of peak performance on 3000 processors of PSC Lemieux, using the load balancing framework presented in this thesis.; We further motivate the need for next generation load balancing strategies for petaflops class machines. We explore a novel design of a scalable hierarchical load balancing scheme, which incorporates an explicit memory cost control function to make it easy to adapt to extremely large machines with small memory footprint. This hierarchical load balancing scheme builds load data from instrumenting an application automatically at run-time on both computation and communication pattern. The load balancing strategy takes application communication pattern into account explicitly.
机译:具有大量处理器的并行计算机现在正在运行。例如,当前正在部署具有128K处理器的IBM BlueGene / L计算机。对于应用程序开发人员而言,编写并行程序以利用可用的巨大计算能力并在此类计算机上手动扩展其应用程序将是一个巨大的挑战。解决这些问题涉及为此类机器找到合适的并行编程模型,并解决诸如负载不平衡之类的问题。本文探讨了Charm ++编程模型中的处理器虚拟化,并采用可移植对象对petaflops类机器进行编程,并行仿真支持算法验证,并行仿真支持性能预测,并使用新型的自动负载平衡策略来解决编程中的许多挑战很大的机器。重要的是要了解大型并行机器上并行应用程序的性能。本文探讨了并行离散事件模拟技术,以模拟并行应用程序并预​​测其性能。我们提出了一种新颖的乐观同步协议,该协议利用并行应用程序中的固有确定性来有效减少同步开销。负载平衡问题对应用程序提出了巨大挑战,以在大型计算机上实现可伸缩性。我们研究负载平衡技术,并开发由多种实际应用激发的负载平衡策略。我们在多个维度的标准中优化我们的负载平衡策略,例如,可感知通信的负载平衡,子步骤负载平衡和计算阶段感知的负载平衡。我们已经使用本文提出的负载平衡框架成功地将NAMD(经典分子动力学应用程序)的峰值性能扩展到1 TF,在3000个PSC Lemieux处理器上。我们进一步激发了对petaflops级机器的下一代负载平衡策略的需求。我们探索了可伸缩的分层负载平衡方案的新颖设计,该方案结合了显式的内存成本控制功能,以使其易于适应具有较小内存占用空间的超大型计算机。这种分层的负载平衡方案通过在运行时自动在计算和通信模式上检测应用程序来构建负载数据。负载平衡策略明确考虑了应用程序通信模式。

著录项

  • 作者

    Zheng, Gengbin.;

  • 作者单位

    University of Illinois at Urbana-Champaign.;

  • 授予单位 University of Illinois at Urbana-Champaign.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 163 p.
  • 总页数 163
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号