首页> 外文学位 >Achieving high performance on extremely large parallel machines: Performance prediction and load balancing.

【24h】

Achieving high performance on extremely large parallel machines: Performance prediction and load balancing.

机译：在超大型并行机上实现高性能：性能预测和负载平衡。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel machines with an extremely large number of processors are now in operation. For example, the IBM BlueGene/L machine with 128K processors is currently being deployed. It is going to be a significant challenge for application developers to write parallel programs in order to exploit the enormous compute power available and manually scale their applications on such machines. Solving these problems involves finding suitable parallel programming models for such machines and addressing issues like load imbalance. This thesis explores processor virtualization in Charm++ programming model and employing migratable objects for programming petaflops class machines supported by parallel emulation for algorithm validation, parallel simulation for performance prediction, and using new kinds of automatic load balancing strategies to substantially address many of these challenges for programming very large machines.; It is important to understand the performance of parallel applications on very large parallel machines. This thesis explores Parallel Discrete Event Simulation techniques to simulate parallel applications and predict their performance. We present a novel optimistic synchronization protocol which exploits the inherent determinacy in parallel applications to effectively reduce the synchronization overhead.; Load balance problem presents significant challenges to applications to achieve scalability on very large machines. We study load balancing techniques and develop a spectrum of load balancing strategies motivated by several real-world applications. We optimize our load balancing strategies in multiple dimensions of criteria such as communication-aware load balancing, sub-step load balancing, and computation phase-aware load balancing. We have successfully scaled NAMD (a classical molecular dynamics application) to 1TF of peak performance on 3000 processors of PSC Lemieux, using the load balancing framework presented in this thesis.; We further motivate the need for next generation load balancing strategies for petaflops class machines. We explore a novel design of a scalable hierarchical load balancing scheme, which incorporates an explicit memory cost control function to make it easy to adapt to extremely large machines with small memory footprint. This hierarchical load balancing scheme builds load data from instrumenting an application automatically at run-time on both computation and communication pattern. The load balancing strategy takes application communication pattern into account explicitly.

机译：具有大量处理器的并行计算机现在正在运行。例如，当前正在部署具有128K处理器的IBM BlueGene / L计算机。对于应用程序开发人员而言，编写并行程序以利用可用的巨大计算能力并在此类计算机上手动扩展其应用程序将是一个巨大的挑战。解决这些问题涉及为此类机器找到合适的并行编程模型，并解决诸如负载不平衡之类的问题。本文探讨了Charm ++编程模型中的处理器虚拟化，并采用可移植对象对petaflops类机器进行编程，并行仿真支持算法验证，并行仿真支持性能预测，并使用新型的自动负载平衡策略来解决编程中的许多挑战很大的机器。重要的是要了解大型并行机器上并行应用程序的性能。本文探讨了并行离散事件模拟技术，以模拟并行应用程序并预测其性能。我们提出了一种新颖的乐观同步协议，该协议利用并行应用程序中的固有确定性来有效减少同步开销。负载平衡问题对应用程序提出了巨大挑战，以在大型计算机上实现可伸缩性。我们研究负载平衡技术，并开发由多种实际应用激发的负载平衡策略。我们在多个维度的标准中优化我们的负载平衡策略，例如，可感知通信的负载平衡，子步骤负载平衡和计算阶段感知的负载平衡。我们已经使用本文提出的负载平衡框架成功地将NAMD（经典分子动力学应用程序）的峰值性能扩展到1 TF，在3000个PSC Lemieux处理器上。我们进一步激发了对petaflops级机器的下一代负载平衡策略的需求。我们探索了可伸缩的分层负载平衡方案的新颖设计，该方案结合了显式的内存成本控制功能，以使其易于适应具有较小内存占用空间的超大型计算机。这种分层的负载平衡方案通过在运行时自动在计算和通信模式上检测应用程序来构建负载数据。负载平衡策略明确考虑了应用程序通信模式。

著录项

作者
Zheng, Gengbin.;
展开▼
作者单位

University of Illinois at Urbana-Champaign.;

展开▼
授予单位 University of Illinois at Urbana-Champaign.;
学科 Computer Science.
学位 Ph.D.
年度 2005
页码 163 p.
总页数 163
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Dynamic Load Balancing Scheme with Load Prediction Mechanism Using the Load Growing Acceleration for Massively Parallel Computers [J] . Atsushi Nunome, Hiroaki Hirata, Haruo Niimi, Systems and Computers in Japan . 2004,第11期

机译：大规模并行计算机基于负载预测机制的带有负载预测机制的动态负载均衡方案的性能评估
2. A highly OR-parallel inference machine (Multi-ASCA) and its performance evaluation: an architecture and its load balancing algorithms [J] . Naganuma J., Ogura T. IEEE Transactions on Computers . 1994,第9期

机译：高度或并行的推理机（Multi-ASCA）及其性能评估：体系结构及其负载平衡算法
3. Performance Prediction of a Series-parallel and Multi-Product Production Line with Unreliable Machines, Finite Buffers and Nonconforming Products [J] . Li Changjun, Li Bo Journal of the Chinese Society of Mechanical Engineers, Series C: Transactions of the Chinese Society of Mechanical Engineers . 2018,第3期

机译：具有不可靠的机器，有限缓冲区和不合格产品的系列平行和多产品生产线的性能预测
4. BigSim: a parallel simulator for performance prediction of extremely large parallel machines [C] . Zheng, G., Gunavardhan Kakulapati, . 2004

机译：BigSim：并行模拟器，用于预测超大型并行机的性能
5. Achieving enhanced performance for vector-parallel isosurface generation. [D] . Emani, Pavan. 2004

机译：实现矢量并行等值面生成的增强性能。
6. Ultra-deep massively parallel sequencing with unique molecular identifier tagging achieves comparable performance to droplet digital PCR for detection and quantification of circulating tumor DNA from lung cancer patients [O] . Le Son Tran, Hong-Anh Thi Pham, Vu-Uyen Tran, 2019

机译：具有独特分子标识符标记的超深度平行测序可与肺癌患者循环肿瘤DNA的检测和定量液滴数字PCR实现了相当的性能
7. Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing [O] . Zheng Gengbin 2005

机译：在超大型并行机上实现高性能：性能预测和负载均衡
8. Machine Learning Based Online Performance Prediction for Runtime Parallelization and Task Scheduling. [R] . B. R. De Supinski J. Li K. Singh M. Schulz X. Ma 2008

机译：基于机器学习的运行时并行化和任务调度在线性能预测。

Achieving high performance on extremely large parallel machines: Performance prediction and load balancing.

摘要

著录项

相似文献

相关主题

期刊订阅